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VACCINES 

The present invention relates to vaccines, and in particular to a method of 
identifying microbial polypeptides which are vaccine candidates. 

5 

The listing or discussion of a prior-published document in this specification 
should not necessarily be taken as an acknowledgement that the document is part 
of the state of the art or is common general knowledge. The documents listed in 
the specification are hereby incorporated by reference. 

10 

Microbial infections remain a serious risk to human and animal health, particularly 
in light of the fact that many pathogenic microorganisms, particularly bacteria, are 
or may become resistant to anti-microbial agents such as antibiotics. 

15 Vaccination provides an alternative approach to combating microbial infections, 
but it is often difficult to identify suitable immunogens for use in vaccines which 
are safe and which are effective against a range of different isolates of a 
pathogenic microorganism, particular a genetically diverse microorganism. 
Although it is possible to develop vaccines which use as the immunogen 

20 substantially intact microorganisms, such as live attenuated bacteria which 
typically contain one or mutations in a virulence-determining gene, not all 
microorganisms are amenable to this approach, and it is not always desirable to 
adopt this approach for a particular microorganism where safety cannot always be 
guaranteed. Also, some microorganisms express molecules which mimic host 

25 proteins, and these are undesirable in a vaccine. 

A particular group of microorganisms for which it is important to develop further 
vaccines is Neisseria meningitidis which causes meningococcal disease, a life 
threatening infection which in the Europe, North America, developing countries 
30 and elsewhere remains an important cause of childhood mortality despite the 
introduction of the conjugate serogroup C polysaccharide vaccine. This is because 
infections caused by serogroup B strains '(NmB), which express an a2-8 linked 
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polysialic acid capsule, are still prevalent. The term "serogroup" in relation to N 
meningitidis refers to the polysaccharide capsule expressed on the bacterium. The 
common serogroup in the UK causing disease is B 5 while in Africa it is A. 
Meningococcal septicaemia continues to carry a high case fatality rate; and 
5 survivors are often left with major psychological and/or physical disability. After a 
non-specific prodromal illness, meningococcal septicaemia can present as a 
fulminant disease that is refractory to appropriate anti-microbial therapy and full 
supportive measures. Therefore, the best approach to combating the public health 
menace of meningococcal disease is through prophylactic vaccination. 

10 

The non-specific early clinical signs and fulminant course of meningococcal 
infection mean that therapy is often ineffective. Therefore vaccination is 
considered the most effective strategy to diminish the global disease burden 
caused by this pathogen (Feavers (2000) ABC of meningococcal diversity. Nature 

15 404, 451-2). Existing vaccines to prevent serogroup A, C, W135, and Y N 
meningitidis infections are based on the polysaccharide capsule located on the 
surface of bacterium (Anderson et al (1994) Safety and immunogenicity of 
meningococcal A and C polysaccharide conjugate vaccine in adults. Infect Immun. 
62, 3391-33955; Leach et al (1997) Induction of immunologic memory in 

20 Gambian children by vaccination in infancy with a group A plus group C 
meningococcal polysaccharide-protein conjugate vaccine. J Infect Dis. 175, 200-4; 
Lieberman et al (1996). Safety and immunogenicity of a serogroups A/C Neisseria 
meningitidis oligosaccharide-protein conjugate vaccine in young children. A 
randomized controlled trial. J. American Med. Assoc. 275, 1499-1503). Progress 

25 toward a vaccine against serogroup B infections has been more difficult as its 
capsule, a homopolymer of ot2-8 linked sialic acid, is a relatively poor immunogen 
in humans. This is because it shares epitopes expressed on a human cell adhesion 
molecule, N-CAM1 (Finne et al (1983) Antigenic similarities between brain 
components and bacteria causing meningitis. Implications for vaccine 

30 development and pathogenesis. Lancet 2, 355-357). Indeed, generating immune 
responses against the serogroup B capsule might actually prove harmful. Thus, 
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there remains a need for new vaccines to prevent serogroup B K meningitidis 
infections. 

The most validated immunologic correlate of protection against meningococcal 
5 disease is the serum bactericidal assay (SBA). The SBA evaluates the ability of 
antibodies (usually IgG2a subclass) in serum to mediate complement deposition 
on the bacterial cell surface, assembly of the membrane attack complex, and 
bacterial lysis. In the SBA, a known number of bacteria are exposed serial 
dilutions of the sera with a defined complement source. The number of surviving 

10 bacteria is detennined, and the SBA is defined as the reciprocal of the highest 
dilution of serum that mediates 50% killing* The SBA is predictive of protection 
against serogroup C infections, and has been widely used as a surrogate for 
immunity against NmB infections. Importantly the SBA is a ready marker of 
immunity for the pre-clinical assessment of vaccines, and provides a suitable 

1 5 endpoint in clinical trials . 

Most efforts at NmB vaccine development are directed toward defining effective 
protein subunits. There has been a major investment in 'Reverse vaccinology', in 
which genome sequences are interrogated for potentially surface expressed 
20 proteins which are expressed as heterologous antigens and tested for their ability 
to generate meaningful responses in animals. However, this approach is limited by 
1) the computer algorithms for predicting surface expressed antigens, 2) failure to 
express many of potential immunogens, and 3) the total reliance on murine 
immune responses. 

25 

The key to a successful vaccine is to define antigen(s) that elicit protection against 
a broad range of disease isolates irrespective of serogroup or clonal group. An 
object of the invention is to use a novel genetic screening method (which we have 
termed Genetic Screening for Immunogens or GSI) to isolate antigens that are 
30 conserved across the genetic diversity of microbial strains and this is exemplified 
in relation to meningococcal strains. This may be done by identifying microbial 
antigens, such as N meningitidis antigens, by GSI as described in more detail 
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below; and validated by assessing the function of the immune response elicited by 
the recombinant antigens and by evaluating the protective efficacy of antigens. 
Although genetics has been applied in the search for vaccine candidates 
previously, it has hitherto been difficult to establish high throughput analyses, and 
5 has been difficult to differentiate between immunogenic and protective antigens. 

A first aspect of the invention provides a method for identifying a polypeptide of a 
microorganism which polypeptide is associated with an immune response in an 
animal which has been subjected to the microorganism, the method comprising the 
10 steps of 

(1) providing a plurality of different mutants of the microorganism; 

(2) contacting the plurality of mutant microorganisms with antibodies from 
15 an animal which has raised an immune response to the microorganism or 

a part thereof, under conditions whereby if the antibodies bind to the 
mutant microorganism the mutant microorganism is killed; 

(3) selecting surviving mutant microorganisms from step (2); 

20 

(4) identifying the gene containing the mutation in any surviving mutant 
microorganism; and 

(5) identifying the polypeptide encoded by the gene. 

25 

The immune response with which the polypeptide is associated is a functionally 
important one in the sense that it is one that may be associated with combating 
infection of the animal by the microorganism. 

30 The microorganism may be any microorganism, such as a bacterium or yeast It is 
preferred if the microorganism is a pathogenic microorganism, and particularly a 
pathogenic bacterium such as Neisseria meningitidis which causes meningococcal 
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disease, or Neisseria gonorrhoeae which causes gonorrhoea, or Haemophilus 
influenzae which causes at least one type of influenza and middle ear infection. 

By "polypeptide associated with an immune response in an animal which has been 
5 subjected to the microorganism" we include any such polypeptide. The method of 
the invention is able to identify polypeptides of microorganisms which 
polypeptides are ones for which antibodies have been raised in an animal when the 
immune system of the animal has been in contact with the polypeptide of the 
microorganism. Typically, the polypeptide is one which is expressed on the 
10 surface of the microorganism. The immune response of the animal is an antibody 
response, typically an IgG response. 

In one embodiment of the invention, the animal has been subjected to the 
microorganism by way of infection with the microorganism, for example a natural 

15 infection* Thus, the animal is typically a host for the microorganism. In another 
embodiment of the invention, the animal has purposefully been inoculated with 
the microorganism (whether live or killed) or part thereof. Either way, the 
animal's immune response has given rise to antibodies directed at the 
microorganism, some of which are selective for particular polypeptides of the 

20 microorganism and which can be used to identify polypeptides by the method of 
the invention. 

It will be appreciated that the term "animal" includes human and in a particularly 
preferred embodiment of the invention, as discussed in more detail below, the 
25 antibodies used in step (2) are ones from a human who is or has been infected with 
a microorganism or has been immunised with part of the microorganism. 

By using antibodies from the animal which has raised an immune response to the 
microorganism or part thereof, immunologically relevant polypeptides may be 
30 identified (in the context of immunogenicity and vaccine design, and particularly 
polypeptides which are protective). 
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Thus, it will be appreciated that the polypeptides identified using the method of 
the invention are antigens or immunogens (these terms being used interchangeably 
in the specification), typically surface exposed, which are ones that give rise to an 
immune response in an animal and so are vaccine candidates. 

5 

The plurality (or library) of different mutant microorganisms typically is a 
sufficient large number to give a high chance (typically >95%) of each gene 
within the microorganism being mutated. The number of mutants required will 
depend on the number of genes in the genome of the microorganism. When the 
10 mutants are random mutants, the number of mutants required to give a high 
chance of a mutant of each gene within the genome being represented is around 1 0 
times the number of genes. Typically, therefore, the number of mutant 
microorganisms provided in step (1) is around 10 to 20 times the number of genes 
in the microorganism. 

15 

Typically, a bacterium has from 500 to 5000 genes, and so the number of random 
mutants used is of the order of 5000 to 100,000. In the case of N. meningitidis a 
suitable number of random mutants is around 40, 000. 

20 The random mutants may be any type of random mutant such as those induced 
chemically; preferably, the mutants are insertion mutants, such as transposon 
mutants, since it is straightforward to identify the position of the insertions (eg 
using probes or PCR primers which are selective for the inserted element eg 
transposon), and typically the transposon carries an antibiotic resistance marker 

25 which allows selection of the mutants containing the transposon. 

Transposons suitable for integration into the genome of Gram negative bacteria 
include TnJ, TnlO and derivatives thereof. Transposons suitable for integration 
into the genome of Gram positive bacteria include TnPid" and derivatives or 
30 analogues thereof. Transposons particularly suited for use with Staphylococcus 
aureus include Tn917 (Cheung et al (1992) Proc. Natl Acad. Set USA 89, 6462- 
6466) and Tn918 (Albus et al (1991) Infect Immun. 59, 1008-1014). 
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It is particularly preferred if the transposons have the properties of the TnP77 
derivatives described by Camilli et al (1990) J. Bacterial 172, 3738-3744, and are 
carried by a temperature-sensitive vector such as pE194Ts (Villafane et al (1987) 
5 J. Bacteriol 169, 4822-4829). 

For TV", meningitidis, TnlO is a preferred transposon (see Sim et al (2000) Nature 
Med. 6, 1269-1273), although any transposon and transposase with in vitro 
activity can be used. 

10 

It will be appreciated that although transposons are convenient for insertionally 
inactivating a gene, any other known method, or method developed in the future 
may be used. A farther convenient method of insertionally inactivating a gene, 
particularly in certain bacteria such as Streptococcus, is using insertion- 
15 duplication mutagenesis such as that described in Morrison et al (1984) 
J.Bacteriol 159, 870 with respect to S. pneumoniae. The general method may also 
be applied to other microorganisms, especially bacteria. 

For fungi, insertional mutations are created by transformation using DNA 
20 fragments or plasmids preferably carrying selectable markers encoding, for 
example, resistance to hygromycin B or phleomycin (see Smith et al (1994) Infect. 
Immunol 62, 5247-5254). Random, single integration of DNA fragments 
encoding hygromycin B resistance into the genome of filamentous fungi, using 
restriction enzyme mediated integration (REMI; Schiestl & Petes (1991); Lu et al 
25 (1994) Proc. Natl Acad Set USA 91, 12649-12653) are known. 

A simple insertional mutagenesis technique for a fungus is described in Schiestl & 
Petes (1994) incorporated herein by reference, and include, for example, the use of 
Ty elements and ribosomal DNA in yeast. 



30 
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Random integration of the transposon or other DNA sequence allows isolation of a 
plurality of independently mutated microorganisms wherein a different gene is 
insertionally inactivated in each mutant. 

5 For some microorganisms, libraries of mutants in which each gene is mutated by a 
transposon or other insertion element are known. In this case, the plurality of 
microorganisms may conveniently be produced by pooling one or more 
representatives of each member of the library. For example, a comprehensive 
transposon library for Pseudomonas aeruginosa is described in Jacobs et al (2003) 
10 Proc. Natl Acad Set USA 100, 14339-14344. 

In step (2) of the method, the plurality of mutant microorganisms is contacted with 
antibodies from the animal which has raised as immune response to the 
microorganism or part thereof. The antibodies may be in any suitable form and 

15 from any suitable, convenient source from the animal (including human). 
Typically, the antibodies are present in serum derived from the animal. However, 
they may be present in other forms, such as a fraction enriched for IgG. It is 
preferred if the antibodies are IgG antibodies, but other antibody types may be 
used, such as IgA and IgM. Although it is preferred if the antibodies are present 

20 in or derived from serum, the antibodies may be present in or derived from other 
body fluids such as saliva. 

The antibodies are typically from an animal which is or has been infected with the 
microorganism. One of the advantages of this embodiment of the method is that it 

25 makes use of antibodies from an animal which has raised a relevant immune 
response in attempting to combat the infection with the microorganism, and such 
antibodies are likely to bind to polypeptides which are useful in vaccines. Thus, 
preferred antibodies are ones which are from humans who are or who have been 
infected with the microorganism, or who have been inoculated with an attenuated 

30 (eg vaccine) strain of the microorganism or who have been vaccinated with a 
vaccine which, comprises a part of the microorganism (such as outer membrane 
component parts). Typically, the antibodies used in step (2) of the method are 
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from an animal (such as man) which has raised a protective response against the 
microorganism. 

Alternatively, the antibodies are from an animal, such as an experimental animal 
such as mouse, rabbit, sheep or horse, which has been inoculated with the 
microorganism and allowed to generate an immune response, preferably a 
protective immune response. Whether or not a protective response has been raised 
may be determined by challenging the animal with live bacteria after inoculation. 
The experimental animals may have been inoculated with a virulent, pathogenic 
strain of the microorganism, or it may have been inoculated with an avirulent or 
attenuated strain (whether live or killed). 

In a preferred embodiment, the antibodies are raised to a strain of microorganism 
"heterologous" to the strain used to produce the mutant microorganism. Many 
pathogenic microorganisms exist in different serogroups or strains, and each 
serogroup or strain may have polypeptides in common with other serogroups or 
strains as well as polypeptides which are unique to the serogroup or strains. The 
advantage of using antibodies raised to one or more heterologous strain(s) is that it 
increases the chances of identifying a polypeptide which is common to all 
serogroups of the microorganism (ie conserved, common epitopes). Such 
polypeptides (or fragments or variants or fusions thereof) are more likely to be 
effective against the range of serogroups of a particular microorganism. Thus, in a 
particularly preferred embodiment where the microorganism is N. meningitidis, 
the plurality of mutant microorganisms are derived from a parent serogroup B 
strain, whereas the antibodies are derived from an animal (such as man) which has 
raised a response to a serogroup A and/or a serogroup C strain. It will be 
appreciated that antibodies may be pooled from more than one source. For 
example, serum from a patient infected with (or convalescing from an infection 
with) serogroup A strain may be pooled with serum from a patient infected with 
(or convalescing from an infection with) serogroup C strain. Serum from a patient 
infected with (or convalescing from an infection with) serogroup B strain may also 
be pooled. Some microorganisms have, in addition to polypeptide components of 
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their cell wall or cell membrane, polysaccharide components which often are 
immunogenic. In a further preferred embodiment, it is convenient to use a strain 
of the microorganism in which some or all of the polysaccharide components have 
been eliminated as the strain against which antibodies are produced. Thus, many 
5 bacteria have a capsule made predominantly of polysaccharide, but typically 
strains exist in which the capsule is missing. These "capsule minus" strains may 
conveniently be used to raise antibodies for use in the second step of the method. 

In relation to K meningitidis , the antibodies may conveniently be present in the 
10 following serum sources: from mice immunized by the systemic route using 
heterologous strains of K meningitidis (ie heterologous to the mutant strain used 
in the selection); from both acute and convalescent human patients infected with 
N. meningitidis', and from human patients immunized with defined outer 
membrane vesicles (OMVs) vaccine derived from the serogroup B NmB isolate 
15 H44/76. Convalescent sera is preferred since the patient will have raised a 
significant immune response to the infecting bacteria. In some circumstance, it 
may be useful to use acute patient serum as a control for the convalescent serum 
since acute patients may not have raised such a significant immune response. 
Equivalent sources of antibodies are available with respect to other 
20 microorganisms. 

Conditions are provided so that when the antibodies bind to the mutant 
microorganism, the mutant microorganism is killed, whereas when the antibodies 
do not bind to the mutant microorganism, the mutant microorganism is not killed. 

25 In this way, it can be seen that those mutant microorganisms in which a gene 
encoding a polypeptide which binds to the antibody is mutated so that the 
antibody no longer binds survive. This provides a very powerful selection for 
such mutants, and facilitates the identification of the polypeptides which are 
associated with an immune response in an animal infected with microorganism. 

30 As a control, it may be convenient to use wild-type microorganisms which are also 
killed under the given conditions, or to develop conditions in which all wild type 
micro organisms are killed. 
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Conveniently, once the antibodies have been contacted with the mutant 
microorganism a source of complement is added, such as complement from 
human, rabbit, mouse, sheep and horse. Conveniently, the complement is from 

5 the same source (ie species of animal) as the antibody. Antibodies (generally IgG 
2a subclass) mediate complement deposition on the surface of the microorganism, 
assembly of the membrane attack complex and lysis of the microorganism. 
Complement-mediated killing is independent of the presence of cells from the 
blood, but requires the presence of serum. Complement-mediated killing may be 

10 inactivated by heating the serum. 

Preferably, in this embodiment, the microorganism is a bacterium, either Gram 
positive or Gram negative. Complement mediated killing is described in Walport 
(2001) K Engl J. Med. 344, 1140-1144, and Walport (2001) K Eng J. Med. 344, 
15 1058-1066. 

The complement deposition approach to killing the microorganisms which retain 
the ability to bind to the antibodies is particularly suited to use with N. 
meningitidis since the serum bactericidal assay (SBA; see Goldschneider et al 

20 (1969) J. Exp. Med, 129, 1307-1326), which is based on the same principle, is 
used as discussed in the introduction. Of course, in the case of the SBA, the 
number of surviving bacteria is used to assess the effectiveness of serum in killing 
bacteria (and using this as a marker of the degree of protection conferred by the 
strain used to give rise to the antibodies use). As far as the inventors are aware 

25 there has never been any suggestion that this method could be adapted to 
identifying antigens in microorganisms. 

As an alternative to using complement to kill the cells to which the antibodies 
bind, a moiety may be used which binds selectively to the antibodies (which bind 
30 the cell) and delivers a cytotoxic agent to the cell. For example, the moiety may 
be a further antibody which recognizes the antibodies bound to the microorganism 
and delivers the cytotoxic agent to the cell. Thus, the further antibody may be an 
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anti-human antibody when the antibody which binds to the mutant microorganism 
is a human antibody. The cytotoxic agent may be directly cytotoxic or it may be 
indirectly cytotoxic. By indirectly cytotoxic we include an enzyme that is capable 
of activating a relatively non-toxic compound to a cytotoxic compound. A similar 
5 technique has been used to target tumour cells using tumour-selective antibodies 
and has been called ADEPT (antibody-directed enzyme prodrug therapy; see WO 
88/07378; Bagshawe (1987) Br. J. Cancer 56, 531-532; Bagshawe et a! (1988) Br. 
J. Cancer 58, 700-703; and Senter et a! (1988) Proc. Natl Acad. Set USA 85, 
4842-4846, all of which are incorporated herein by reference). 

10 

Enzyme - prodrug pairs include the following: Alkaline phosphatase useful for 
converting phosphate-containing prodrugs into free drugs, aryl sulphatase useful 
for converting sulphate-containing prodrugs into free drugs, cytosine deaminase 
useful for converting non-toxic 5-fluorocytosine into the anticancer drug 5- 

15 fluorouracil, proteases such as Serratia protease, thermolysin, subtilisin, carboxy- 
peptidases and cathepsins that are useful for converting peptide-containing 
prodrugs into free drugs, D-alanylcarboxypeptidases, useful for converting 
prodrugs that contain D-amino acid substituents, carbohydrate-enzymes such as p- 
galactosidase and neuraminidase useful for converting glycosylated prodrugs into 

20 free drugs, (3-lactamase useful for converting drugs derivatized with (3-lactams 
into free drugs and penicillin amidases useful for converting drugs derivatized at 
their amine nitrogens with phenoxyacetyl or phenylacetyl groups into free drugs. 

Other enzymes and prodrugs include hydrolases, amidases, sulphatases, lipases, 
25 glucuronidases, phosphatases and carboxypeptidases, and prodrugs be prepared 
from any of the various classes of anti-tumour compounds for example all<ylating 
agents (nitrogen mustards) including cyclophosphamide, bisulphan, chlorambucil 
and nitrosoureas; intercalating agents including adriamycin and dactinomycin; 
spindle poisons such as vinca alkaloids; and anti-metabolites including anti- 
30 folates, anti-purines, anti-pyrimidines or hydroxyurea. 
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Also included are cyanogenic prodrugs such as amygdalin which produce cyanide 
upon action with a carbohydrate cleaving enzyme. 

Mutant microorganisms which survive the conditions, for example those 
5 conditions which kill all wild type (or parent) microorganisms (and indeed the 
majority of mutants), are selected for further study since such mutant 
microorganisms are likely to be mutated in a gene which encodes a polypeptide 
which binds to the antibodies (and therefore is involved in an immune response). 
In one embodiment, and in order to confirm that the mutations in the surviving 
10 mutants are responsible for conferring the ability to withstand killing, the mutation 
of each mutant may backcrossed into the parental strain and the ability of the 
backcrossed mutant to survive the killing conditions confirmed. 

The gene containing the mutation is identified using methods well known in the 

15 art. For example, when the mutation is an insertion mutation, it is convenient to 
sequence from the insertion into the flanking DNA of the microorganism. When a 
transposon has been used to create the mutant microorganisms, it is convenient to 
identify the gene containing the transposon mutation by digesting genomic DNA 
from the individual mutant selected in step (3) with a restriction enzyme which 

20 cuts outside the transposon, ligating size-fractionated DNA containing the 
transposon into a plasmid, and selecting plasmid recombinants on the basis of 
antibiotic resistance conferred by the transposon and not by the plasmid. The 
microorganism genomic DNA adjacent to the transposon may be sequenced using 
two primers which anneal to the terminal regions of the transposon, and two 

25 primers which anneal close to the polylinker sequences of the plasmid. The 
sequences may be subjected to DNA database searches to determine if the 
transposon has interrupted a known gene. Thus, conveniently, sequence obtained 
by this method is compared against the sequences present in the publicly available 
databases such as EMBL and GenBank, or a complete genome sequence, if 

30 available. 
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By "gene" we include not only the regions of DNA that code for a polypeptide but 
also regulatory regions of DNA such as regions of DNA that regulate 
transcription, translation and, for some microorganisms, splicing of RNA. Thus, 
the gene includes promoters, transcription terminators, ribosome-binding 
5 sequences and for some organisms introns and splice recognition sites. 

Typically, sequence information of the identified gene obtained in step 4 is 
derived. Conveniently, sequences close to the ends of the transposon are used as 
the hybridisation site of a sequencing primer. The derived sequence or a DNA 
10 restriction fragment adjacent to the inactivated gene itself is used to make a 
hybridisation probe with which to identify, and isolate from a wild-type organism, 
the corresponding wild type gene. 

It is preferred if the hybridisation probing is done under stringent conditions to 
15 ensure that the gene, and not a relative, is obtained, at least when identifying the 
gene. 

The gene may be sequenced using standard methods and the polypeptide encoded 
by the gene identified, for example by translating the coding sequence of the gene, 
20 or the sequence may already be available as part of a sequenced microorganism 
genome. 

As described in more detail in the Example, particular genes identified by the 
method of the invention are the NBM0341 (TspA), NMB0338, NMB1345, 

25 NMB0738, NBM0792 (NadC family), NMB0279, NMB2050, NMB1335 (CreA), 
NMB2035, NMB1351 (Fmu and Fmv), NMB1574 (IIvC), NMB1298 (rsuA), 
NMB1856 (LysR family), NMB0119, NMB1705 (rfak), NMB2065 (HemK), 
NMB0339, NMB0401 (putA), NMB1467 (PPX), NMB2056, NMB0808, 
NMB0774 (upp), NMA0078, NMB0337 (branched-chain amino acid transferase), 

30 NMB0191 (Par A family), NMB1710 (glutamate dehydrogenase (gdliA), 
NMB0062 (rfbA-1) and NMB1583 (hisB) genes of Neisseria meningitidis. The 
genome sequence for N. meningitidis is available, for example from The Institute 



WO 2005/060995 PCT/GB2004/005441 

15 

of Genome Research (TIGR); www.tigr.org. Although these genes form part of 
the genome that has been sequenced, as far as the inventors are aware, they have 
not been isolated, the polypeptides they encode have not been produced, and there 
is no indication that the polypeptides they encode may be useful as a component 
5 of a vaccine. 

Thus, the invention includes the isolated genes as above and in the Examples and 
variants and fragments and fusions of such variants and fragments, and includes 
the polypeptides that the genes encode as described above, along with variants and 

10 fragment thereof, and fusions of such fragments and variants. Variants, fragments 
and fusions are described in more detail below. Preferably, the variants, 
fragments and fusions of the given genes above are ones which encode a 
polypeptide which gives rise to neutralizing antibodies against N. meningitidis. 
Similarly, preferably, the variants, fragments and fusions of the polypeptide whose 

15 sequence is given above axe ones which gives rise to neutralizing antibodies 
against N. meningitidis. The invention also includes isolated polynucleotides 
encoding the polypeptides whose sequences are given in the Example (preferably 
the isolated coding region) or encoding the variants, fragments or fusions. The 
invention also includes expression vectors comprising such polynucleotides and 

20 host cells comprising such polynucleotides and vectors (as is described in more 
detail below). The polypeptides described in the Examples are antigens identified 
by the method of the invention. 

Molecular biological methods for use in the practice of the method of the 
25 invention are well known in the art, for example from Sambrook & Russell (2001) 
Molecular Cloning, a laboratory manual, third edition, Cold Spring Harbor 
laboratory Press, Cold Spring Harbor, New York, incorporated herein by 
reference. 

30 It will be appreciated that the invention also includes carrying out steps (1) to (4) 
of the method (but not necessarily step (5)) so that a gene encoding a polypeptide 
which is associated with an immune response in an animal which has been 
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subjected to the microorganism is identified. The gene may be cloned and 
sequenced or may be isolated or synthesised, for example by using the polymerase 
chain reaction using primers based on its sequence. Variants of the gene may be 
made, for example by identifying related genes in other microorganisms or in 
5 other strains of the microorganism, and cloning, isolating or synthesizing the gene. 
Typically, variants of the gene axe ones which have at least 70% sequence 
identity, more preferably at least 85% sequence identity, most preferably at least 
95% sequence identity with the genes isolated by the method of the invention. Of 
course, replacements, deletions and insertions may be tolerated. The degree of 
10 similarity between one nucleic acid sequence and another can be determined using 
the GAP program of the University of Wisconsin Computer Group. 

Variants of the gene are also ones which hybridise under stringent conditions to 
the gene. By "stringent" we mean that the gene hybridises to the probe when the 
15 gene is immobilised on a membrane and the probe (which, in this case is >200 
nucleotides in length) is in solution and the immobilised gene/hybridised probe is 
washed in 0.1 x SSC at 65°C for 10 min. SSC is 0.15 M NaCl/0.015 M Na 
citrate. 

20 Fragments of the gene (or the variant gene) may be made which are, for example, 
20% or 30% or 40% or 50% or 60% or 70% or 80% or 90% of the total of the 
gene. Preferred fragments include all or part of the coding sequence. The variant 
and fragments may be fused to other, unrelated, polynucleotides. 

25 Thus the invention also includes a method for making a polynucleotide the method 
comprising carrying out steps (1) to (4) of the method of the invention and 
isolating or synthesising the identified gene or a variant or fragment thereof or a 
fusion of such gene or variant or fragment. The invention also includes a 
polynucleotide obtainable or obtained by this method. 

30 
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Preferably, the polynucleotide encodes a polypeptide which is immunogenic and 
is reactive with the antibodies from an animal which has been subjected to the 
microorganism from which the gene was identified. 

5 The invention also includes a method of selecting a microorganism mutated in a 
gene encoding a polypeptide which is associated with an immune response in an 
animal which has been subjected to the microorganism. This method comprises 
carrying out steps (1) to (3) of the method of the invention (whether or not steps 
(4) and (5) are carried out). The invention also included a mutant microorganism 
10 obtainable or obtained by this method which is not able to bind to the antibodies. 

Although as discussed above the method of the invention is useful in identifying 
genes and selecting mutant microorganisms, it is particularly preferred if the 
method is used to identify polypeptides from a microorganism which are 
15 associated with an immune response. Once identified, it is desirable to make an 
antigen based on the polypeptide. 

The antigen may be the polypeptide as encoded by the gene identified, and the 
sequence of the polypeptide may readily be deduced from the gene sequence. In 
20 further embodiments, the antigen may be a fragment of the identified polypeptide 
or may be a variant of the identified polypeptide or may be a fusion of the 
polypeptide or fragment or variant. 

Fragments of the identified polypeptide may be made which are, for example, 
25 20% or 30% or 40 % or 50% or 60% or 70% or 80% or 90% of the total of the 
polypeptide. Typically, fragments are at least 10, 15, 20, 30, 40 , 50, 100 or more 
amino acids, but less than 500, 400, 300 or 200 amino acids. Variants of the 
polypeptide may be made. By "variants" we include insertions, deletions and 
substitutions, either conservative or non-conservative, where such changes do not 
30 substantially alter the normal function of the protein. By "conservative 
substitutions" is intended combinations such as Gly, Ala; Val, lie, Leu; Asp, Glu; 
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Asn, Gin; Ser, Thr; Lys, Arg; and Phe 5 Tyr. Such variants may be made using the 
well known methods of protein engineering and site-directed mutagenesis. 

A particular class of variants are those encoded by variant genes as discussed 
above, for example from related microorganisms or other strains of the 
microorganism. Typically the variant polypeptides have at least 70% sequence 
identity, more preferably at least 85% sequence identity, most preferably at least 
95 % sequence identity with the polypeptide identified using the method of the 
invention. 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, for example the GAP program of the University of 
Wisconsin Genetic Computing Group and it will be appreciated that percent 
identity is calculated in relation to polypeptides whose sequence has been aligned 
optimally. 

The alignment may alternatively be carried out using the Clustal W program 
(Thompson et ah 9 (1994) Nucleic Acids Res 22, 4673-80). The parameters used 
may be as follows: 

Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 5, gap 
penalty; 3, number of top diagonals; 5. Scoring method: x percent. 
Multiple alignment parameters: gap open penalty; 10, gap extension penalty; 0.05. 
Scoring matrix: BLOSUM. 

The fusions may be fusions with any suitable polypeptide. Typically, the 
polypeptide is one which is able to enhance the immune response to the 
polypeptide it is fused to. The fusion partner may be a polypeptide that facilitates 
purification, for example by constituting a binding site for a moiety that can be 
immobilised in, for example, an affinity chromatography column. Thus, the 
fusion partner may comprise oligo-histidine or other amino acids which bind to 
cobalt or nickel ions. It may also be an epitope for a monoclonal antibody such as 
a Myc epitope. 
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The invention also includes, therefore, a method of making an antigen as 
described above, and antigens obtainable or obtained by the method. 

The polynucleotides of the invention may be cloned into vectors, such as 
expression vectors, as is well known on the art. Such vectors maybe present in 
host cells, such as bacterial, yeast, mammalian and insect host cells. The antigens 
of the invention may readily be expressed from polynucleotides in a suitable host 
cell, and isolated therefrom for use in a vaccine. 

Typical expression systems include the commercially available pET expression 
vector series and E. coli host cells such as BL21 . The polypeptides expressed may 
be purified by any method known in the art. Conveniently, the antigen is fused to 
a fusion partner that binds to an affinity column as discussed above, and the fusion 
is purified using the affinity column (eg such as a nickel or cobalt affinity 
column). 

It will be appreciated that the antigen or a polynucleotide encoding the antigen 
(such as a DNA molecule) is particularly suited for use as in a vaccine. In that 
case, the antigen is purified from the host cell it is produced in (or if produced by 
peptide synthesis purified from any contaminants of the synthesis). Typically the 
antigen contains less that 5% of contaminating material, preferably less than 2%, 
1%, 0.5%, 0.1%, 0.01%, before it is formulated for use in a vaccine. The antigen 
desirably is substantially pyrogen free. Thus, the invention further includes a 
vaccine comprising the antigen, and method for making a vaccine comprising 
combining the antigen with a suitable carrier, such as phosphate buffered saline. 
Whilst it is possible for an antigen of the invention to be administered alone, it is 
preferable to present it as a pharmaceutical formulation, together with one or more 
acceptable carriers. The carrier (s) must be "acceptable" in the sense of being 
compatible with the antigen of the invention and not deleterious to the recipients 
thereof. Typically, the carriers will be water or saline which will be sterile and 
pyrogen free. 
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The vaccine may also conveniently include an adjuvant. Active immunisation of 
the patient is preferred. In this approach, one or more antigens are prepared in an 
immunogenic formulation containing suitable adjuvants and carriers and 
5 administered to the patient in known ways. Suitable adjuvants include Freund's 
complete or incomplete adjuvant, muramyl dipeptide, the "Iscoms" of EP 109 942, 
EP 180 564 and EP 231 039, aluminium hydroxide, saponin, DEAE-dextran, 
neutral oils (such as miglyol), vegetable oils (such as arachis oil), liposomes, 
Pluronic polyols or the Ribi adjuvant system (see, for example GB-A-2 189 141). 
10 "Pluronic" is a Registered Trade Mark. The patient to be immunised is a patient 
requiring to be protected from infection with the microorganism. 

The aforementioned antigens of the invention (or polynucleotides encoding such 
antigens) or a formulation thereof may be administered by any conventional 
15 method including oral and parenteral (eg subcutaneous or intramuscular) injection. 
The treatment may consist of a single dose or a plurality of doses over a period of 
time. 

It will be appreciated that the vaccine of the invention, depending on its antigen 
20 component (or polynucleotide), may be useful in the fields of human medicine and 
veterinary medicine. 

Diseases caused by microorganisms are known in many animals, such as domestic 
animals. The vaccines of the invention, when containing an appropriate antigen or 
25 polynucleotide encoding an antigen, are useful in man but also in, for example, 
cows, sheep, pigs, horses, dogs and cats, and in poultry such as chickens, turkeys, 
ducks and geese. 

Thus, the invention also includes a method of vaccinating an individual against a 
30 microorganism, the method comprising administering to the individual an antigen 
(or polynucleotide encoding an antigen) or vaccine as described above. The 
invention also includes the use of the antigen (or polynucleotide encoding an 
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antigen) as described above in the manufacture of a vaccine for vaccinating an 
individual. 

The antigen of the invention may be used as the sole antigen in a vaccine or it may 
be used in combination with other antigens whether directed at the same or 
different disease microorganisms. In relation to N. meningitidis, the antigen 
obtained which is reactive against NmB may be combined with components used 
in vaccines for the A and/or C serogroups. It may also conveniently be combined 
antigenic components which provide protection against Haemophilus and/or 
Streptococcus pneumoniae. The additional antigenic components may be 
polypeptides or they may be other antigenic components such as a polysaccharide. 
Polysaccharides may also be used to enhance the immune response (see, for 
example, Makela et at (2002) Expert Rev. Vaccines 1, 399-410). 

It is particularly preferred in the above vaccines and methods of vaccination if the 
antigen is the polypeptide encoded by any of the genes as described above (and in 
the Examples), such as the NMB0338 gene, or a variant or fragment or fusion as 
described above (or a polynucleotide encoding said antigen), and that the disease 
to be vaccinated against is Neisseria meningitidis infection (meningococcal 
disease). 

The invention will now be described in greater detail by reference to the following 
non-limiting Examples and Figure wherein: 

Figure 1 is a schematic representation of a preferred embodiment of the method of 
the invention 

Example 1: Genetic screening for immunogens (GSI) in N. meningitidis 

The application of GSI in this example involves screening libraries of insertional 
mutants of K meningitidis for strains which are less susceptible to killing by 
bactericidal antibodies. 
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We have demonstrated the effectiveness of GSI by screening a library of mutants 
of the sequenced NmB isolate, MC58 5 with sera raised in mice against a capsule 
minus of the same strain. A total of 40,000 mutants was analysed with sera raised 
5 in mice by intraperitoneal immunisation with the homologous strain; the SBA of 
this sera is around 2,000 against the wild-type strain. Surviving mutants were 
detected when the library was exposed to serum at a 1 :5 60 dilution (which kills all 
wild-type bacteria). To establish whether the transposon insertion in the surviving 
mutants was responsible for the ability to withstand killing, the mutations were 

10 backcrossed into the parental strain, and the backcrossed mutants were confirmed 
as being more resistant to killing than the wild-type in the SBA. The sequence of 
the gene affected by the transposon was examined by isolating the transposon 
insertion site by marker rescue. We found that two of the genes affected were 
TspA and NMB0338. TspA is a surface antigen which elicits strong CD4+ T cell 

15 responses and is recognized by sera from patients (Kizil et al (1999) Infect Immun. 
67, 3533-41). NMB0338 is a gene of previously unknown function which 
encodes a polypeptide that is predicted to contain two transmembrane domains, 
and is located at the cell surface. The amino acid sequence encoded by NMB0338 
is: 

20 MERNGVFGKIVGNRILRMSSEHAAASYPKPCKS FKLAQSWFRVRSCLGGVFI YGA 
NMKLIYTVIKIIILLLFLLLAVINTDAVTFSYLPGQKFDLPLIVVLFGAFWGII 
FGMFALFGRLLSLRGENGRLRAEVKKNARLTGKELTAPPAQNAPESTKQP 

There are several practical advantages of using NmB for GSI aside from the public 
25 health imperative: a) the bacterium is genetically tractable; b) killing of the 
bacterium by effector immune mechanism is straightforward to assay; c) the 
genome sequences are available for three isolates of different serogroups and 
clonal lineages (IV-A, ET-5, and ET-37 for serogroups A, B, and C, respectively); 
and d) well-characterised clinical resources are available for this work. 

30 

GSI has two potential limitations. First, targets of bactericidal antibodies may be 
essential. This is unlikely as all known targets of bactericidal antibodies in NmB 
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are non-essential, and no currently licensed bacterial vaccine targets an essential 
gene product. Second, sera will contain antibodies to multiple antigens, and, loss 
of a single antigen may not affect the survival of mutants. We have already shown 
that even during selection with sera raised against the homologus strain, relevant 
antigens were still identified using appropriate dilutions of sera. 

The major advantages of GSI are that 1) the high throughput steps do not involve 
technically demanding or costly procedures (such as protein 
expression/purification and immunisation), and 2) human samples can be used in 
the assay rather than relying solely on animal data. GSI will rapidly pinpoint the 
subset of surface proteins that elicit bactericidal activity, allowing more detailed 
analysis of a smaller number of candidates. 

1 . Identification of targets of bactericidal antibodies using GSI 

Murine sera raised against heterologous strains, and human sera, are used to 

identify cross-reactive antigens. The sera are obtained from: 

i) mice immunised by the systemic route with heterologous strains: the 
strains will be selected and/or constructed to avoid isolates with the 
same immunotype and sub-serotype. 

ii) acute and convalescent sera from patients infected with known isolates 
of N. meningitidis (provided by Dr R. Wall, Northwick Park) 

iii) pre- and post-immunisation samples (provided by the Meningococcal 
Reference Laboratory) from volunteers receiving defined outer 
membrane vesicle (OMVs) vaccines derived from the NmB isolate, 
H44/76. 

Each of these sources of sera has specific advantages and disadvantages. 
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Serum source 


Advantages 


Disadvantages 


Murine 
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1) Animal source of 
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3) Acute and convalescent sera available 




Sera following 
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a) Sera from animals immunised with heterologous strains (ie the sequenced 
sero group A or C strains) are used in GSI to select the library of MC58 mutants. 
We have shown that immunisation with live, attenuated NrriB elicits cross-reactive 

5 bactericidal antibody responses against serogroup A and C strains. The antigen 
absent in mutants with enhanced survival in the face of human sera are identified 
by marker rescue of the disrupted gene. 

b) Mutations are identified that confer resistance against killing by 
10 heterologous sera, and it is determined whether the gene product is also a target 

for killing of the sequenced, serogroup A and C strains, Z2491 and FAM18 
respectively. The genome databases are inspected for homologues of the genes. If 
a homologue is present, the transposon insertion is amplified from the MC58 
mutant and introduced into the serogroup A and C strains by transformation. The 
15 relative survival of the mutant and wild-type strain of each serogroup are 
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compared. Thus, GSI can quickly give information whether the targets of 
bactericidal activity are conserved and accessible in diverse strains of N. 
meningitidis, irrespective of serogroup, immunotype and subserotype. 

5 c) Mutants with enhanced survival against sera raised in mice are tested using 
human sera from either convalescent patients or vaccinees receiving heterologous 
OMV vaccines (derived from H44/76). This addresses the important question of 
whether the targets are capable of eliciting bactericidal antibodies in human. With 
other vaccine approaches, this information is only gained at the late, expensive 
10 stage of clinical trials that requires GMP manufacture of vaccine candidates. 

The advantages are that GSI is a high-throughput analysis performed using simple, 
available techniques. Antigens which elicit bactericidal antibodies in humans and 
which mediate killing of multiple strains can be identified rapidly as GSI is 
15 flexible with respect to the bacterial strain and sera used. Mutants selected using 
human sera are analysed in the same way as those selected by murine sera. 

2. Assessment of the antibody response of recombinant GSI antigens 

20 Proteins which are targets of bactericidal antibodies that are recognised by sera 
from convalescent patients and vaccines are expressed in E. coli using 
commercially available vectors. The corresponding open reading frames are 
amplified by PGR from MC58, and ligated into vectors such as pCR Topo CT or 
pBAD/His, to allow protein expression under the control of a T7 or arabinose- 

25 inducible promoter, respectively. Purification of the recombinant proteins from 
total cellular protein is performed via the His Tag fused to the C terminus of the 
protein on a Nickel or Cobalt column. 

Adult New Zealand White rabbits are immunized on two occasions separated by 
30 four weeks by subcutaneous injection with 25 jag of purified protein with Freund's 
incomplete adjuvant. Sera from animals will be checked prior to immunisation for 
pre-existing anti-Afa? antibodies by whole cell ELISA. Animals which have an 
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initial serum titre of <1:2 are used for immunisation experiments. Post- 
immunisation serum are obtained two weeks after the second immunisation. To 
confirm that specific antibodies have been raised, pre- and post-immunisation 
serum is tested by i) Western analysis against the purified protein and ii) ELISA 
5 using cells from the wild-type and the corresponding mutant (generated by GSI). 

SBAs will be performed against MC58 (the homologous strain), and the 
sequenced serogroup A and C strains with the rabbit immune serum. The assay 
will be performed in triplicate on at least two occasions. SBAs of >8 will be 
10 considered significant. The results provide evidence of whether the protein 
candidates can elicit bactericidal antibodies as recombinant proteins. 

3 . Establishing the protective efficacy of GSI antigens 

15 All the candidates are tested for their ability to protect animals against live 
bacterial challenge as this allows any aspect of immunity (cellular or humoral) to 
be assessed in a single assay. We have established a model of active immunisation 
and protection against live bacterial infection. In this model, adult mice are 
immunised on days 0 and 21, and on day 28 receive live bacterial challenge of 10 6 

20 or 10 7 CFU of MC58 intraperitoneally in iron dextran (as the supplemental iron 
source). The model is similar to that described for evaluation of the protective 
efficacy of immunisation with Tbps Danve et al (1993) Vaccine 11, 1214-1220. 
Non-immunised animals develop bacteraemia within 4 hours of infection, and 
show signs of systemic illness by 24 hours. We have already been able to 

25 demonstrate the protective efficacy of both attenuated Nm strains and a protein 
antigen against live meningococcal challenge; PorA is an outer membrane protein 
that elicits bactericidal antibodies, but which is not a lead vaccine candidate 
because of extensive antigenic variation(Bart et al (1999) Infect Immun. 67, 3832- 
3846. 

30 

Six week old, BALB/c mice (group size, 35 animals) receive 25 jug of 
recombinant protein with Freund's incomplete adjuvant subcutaneously on days 
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day 0 and 21 , then are challenged with 10 6 (15 animals) or 10 7 (15 animals) CFU 
of MC58 intraperitoneally on day 28. Two challenge doses are used to examine 
the vaccine efficacy at a high and low challenge dose; sera are obtained on day 28 
from the remaining five animals in each group, and from five animals before the 

5 first immunisation and stored at -70°C for further mimunological assays. Animals 
in control groups receive either i) adjuvant alone, ii) recombinant refolded PorA, 
and iii) a live, attenuated Nm strain. To reduce the overall number of animals in 
control groups, sets of five candidates will be tested at one time (number of groups 
= 5 candidates + 3 controls). Survival of animals in the groups is compared by 

10 Mann Whitney U Test. With group sizes of 15 mice/dose, the experiments axe 
powered to show a 25% difference in survival between groups. 

For vaccines which show significant protection against challenge, a repeat 
experiment is performed to confirm the finding. Furthermore, to establish that 
15 vaccination with a candidate also elicits protection against bacteraemia, levels of 
bacteraemia are determined during the second experiment; blood is sampled at 22 
hr post-infection in immunised and un-immunised animals (bacteraemia is 
maximal at this time). The results are analysed using a two-tailed Student-T test to 
determine if there is a significant reduction in bacteraemia in vaccinated animals. 



20 
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Further materials and methods used 

Mutagenesis of Neisseria meningitidis 

5 For work with Neisseria meningitidis, mutants were constructed by in vitro 
mutagenesis. Genomic DNA from N. meningitidis was subjected to mutagenesis 
with a Tn5 derivative containing a marker encoding resistance to kanamycin, and 
an origin of replication which is functional in E. coli. These elements are bound 
by composite TnJ ends. Transposition reactions were carried out with a 
10 hyperactive variant of Tn5 and the DNA repaired with T4 DNA polymerase and 
ligase in the presence of ATP and nucleotides. The repaired DNA was used to 
transform N meningitidis to kanamycin resistance. Southern analysis confirmed 
that each mutant contained a single insertion of the transposon only. 

15 Serum bactericidal assays (SB As) 

Bacteria were grown overnight on solid media (brain heart infusion media with 
Levanthals supplement) and then re-streaked to solid media for four hours on the 
morning of experiments. After this time, bacteria were harvested into phosphate 
20 buffered saline and enumerated. SBAs were performed in a 1 ml volume, 
containing a complement source (baby rabbit or human) and approximately 10 5 
colony forming units. The bacteria were collected at the end of the incubation and 
plated to solid media to recover surviving bacteria. 

25 Isolating the transposon insertion sites 

Genomic DNA will be recovered from mutants of interest by standard methods 
and digested with Prall, EcoKV, and Dral for three hours, then purified by phenol 
extraction. The DNA will then be self-ligated in a 100 microlitre volume 
30 overnight at 16°C in the presence of T4 DNA ligase, precipitated, then used to 
transform E. coli to kanamycin resistance by electroporation. 
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Example 2: Further screening and results thereof 
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GSI has been used to screen a librae of approximately 40,000 insertional mutants 
of MC58. The library was constructed by in vitro Tn5 mutagenesis, using a 
5 transposon harbouring the origin of replication from p AC YC 1 84. 



MC58 was chosen as it is a sero group B isolate of N. meningitidis, and the 
complete genome sequence of this strain is known. 



10 The library is always screened in parallel with the wild-type strain as a control, 
and the number of colonies recovered from the library and the wild-type is shown. 



Selection with murine sera 



15 Initially the library was analysed using sera from animals immunised with the 
attenuated strain YH102. Adult mice (Balb/C) received 108 colony forming units 
intra-peritoneally on three occasions, and sera was collectd 1 0 days after the final 
immunisation, 



20 The screen identified several mutants with enhanced resistance to serum killing: 
This was confirmed by isolating individual mutants, reconstructing the mutation in 
the original genetic background, and re-testing the individual mutants for their 
susceptibility to complement mediated lysis against the wild-tye. The transposon 
insertions are in the following gene: 

25 

NMB0341 (TspA) DNA sequence 

ATGCCCGCCGGCCGACTGCCCCGCCGATGCCCGATGATGACGAAATTTACAGACTGTACG 
CGGTC7VAACCGTATTCAGCCGCCAACCCACAGGGGATACATCTTGAAAAACAACAGACAA 
ATCAAACTGATTGCCGCCTCCGTCGCAGTTGCCGCATCCTTTCAGGCACATGCTGGACTG 

30 GGCGGACTGAATATCCAGTCCAACCTTGACGAACCCTTTTCCGGCAGCATTACCGTAACC 
GGCGAAGAAGCCAAAGCCCTGCTAGGCGGCGGCAGCGTTACCGTTTCCGAAAAAGGCCTG 
ACCGCCAAAGTCCACAAGTTGGGCGACAAAGCCGTCATTGCCGTTTCTTCCGAACAGGCA 
GTCCGCGATCCCGTCCTGGTGTTCCGCATCGGCGCAGGCGCACAGGTACGCGAATACACC 
GCCATCCTCGATCCTGTCGGCTACTCGCCCAAAACCAAATCTGCACTTTCAGACGGCAAG 

35 AC AC AC C G CAAAAC C G C T C C G AC AG C AG AG T CC C AAGAAAAT C AAAAC G C C AAAGC C C T C 
C GC A2AACC G AT AAAAAAG AC AGC G C GAAC G C AGC C GT C AAAC C GGC AT AC AAC GG C AAA 
ACCCATACCGTCCGCAAAGGCGAAACGGTCAAACAGATTGCCGCCGCCATCCGCCCGAAA 
CACCTGACGCTCGAACAGGTTGCCGATGCGCTGCTGAAGGCAAACCCAAATGTTTCCGCA 
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CACGGCAGACTGCGTGCGGGCAGCGTGCTTCACATTCCGAATCTGAACAGGATCAAAGCG 
GAACAACCCAAACCGCAAACGGCGAAACCCAAAGCCGAAACCGCATCCATGCCGTCCGAA 
CCGTCCAAACAGGCAACGGTAGAGAAACCGGTTGAAAAACCTGAAGCAAAAGTTGCCGCG 
CCCGAAGCAAAAGCGGAAAAACCGGCCGTTCGACCCGAACCTGTACCCGCTGCAAATACT 
5 GCCGCATCGGAAACCGCTGCCGAATCCGCCCCCCAAGAAGCCGCCGCTTCTGCCATCGAC 
ACGCCGACCGACGAAACCGGTAACGCCGTTTCCGAACCTGTCGAACAGGTTTCTGCCGAA 
GT^AGAAACCGAAAGCGGACTGTTTGACGGTCTGTTCGGCGGTTCGTACACCTTGCTGCTT 
GCCGGCGGAGGCGCGGCATTAATCGCCCTGCTGCTGCTTTTGCGCCTTGCCCAATCCAAA 
CGCGCGCGCCGTACCGAAGAATCCGTCCCTGAGGAAGAGCCTGACCTTGACGACGCGGCA 

10 GACGACGGCATAGAAATCACCTTTGCCGAAGTCGAAACTCCGGCAACGCCCGAACCCGCT 
CCGAAAAACGATGTAAACGACACACTTGCCTTAGATGGGGAATCTGAAGAAGAGTTATCG 
GCAAAACAAACGTTCGATGTCGAAACCGATACGCCTTCCAACCGCATCGACTTGGATTTC 
GACAGCCTGGCAGCCGCGCAAAACGGCATTTTATCCGGCGCACTTACGCAGGATGAAGAA 
ACCCAAAAACGCGCGGATGCCGATTGGAACGCCATCGAATCCACAGACAGCGTGTACGAG 

15 CCCGAGACCTTCAACCCGTACAACCCTGTCGAAATCGTCATCGACACGCCCGAACCGGAA 
TCTGTCGCCCAAACTGCCGAAAACAAACCGGAAACCGTCGATACCGATTTCTCCGACAAC 
CTGCCCTCAAACAACCATATCGGCACAGAAGAAACAGCTTCCGCAAAACCTGCCTCACCC 
TCCGGACTGGCAGGCTTCCTGAAGGCTTCCTCGCCCGAAACCATCTTGGAAAAAACAGTT 
GCCGAAGTCCAAACACCGGAAGAGTTGCACGATTTCCTGAAAGTGTACGAAACCGATGCC 

20 GTCGCGGAAACTGCGCCTGAAACGCCCGATTTCAACGCCGCCGCAGACGATTTGTCCGCA 
TTGCTTCAACCTGCCGAAGCACCGTCCGTTGAGGAAAATATAACGGAAACCGTTGCCGAA 
ACACCCGACTTCAACGCCACCGCAGACGATTTGTCCGCATTACTTCAACCTTCTAAAGTA 
CCTGCCGTTGAGGAAAATGCAGCGGAAACCGTTGCCGATGATTTGTCCGCACTGTTGCAA 
CCTGCTGAAGCACCGGCCGTTGAGGAAAATGTAACGGAAACCGTTGCCGAAACACCCGAT 

25 TTCAACGCCACCGCAGACGATTTGTCCGCATTACTTCAACCTTCTGAAGCACCTGCCGTT 
GAGGAAAATGCAGCGGAAACCGTTGCCGATGATTTGTCCGCACTGTTGCAACCTGCTGAA 
GCACCGGCCGTTGAGGAAAATGCAGCGGAAATCACTTTGGAAACGCCTGATTCCAACACC 
TCTGAGGCAGACGCTTTGCCCGACTTCCTGAAAGACGGCGAGGAGGAAACGGTAGATTGG 
AGCATCTACCTCTCGGAAGAAAATATCCCAAATAATGCAGATACCAGTTTCCCTTCGGAA 

30 TCTGTAGGTTCTGACGCGCCTTCCGAAGCGAAATACGACCTTGCCGAAATGTATCTCGAA 
AT CGGC GACC GCGAT GCC GC T GC C GAGAC AG T GCAGAAAT T GCTGGAAGAAGCGGAAGGC 
GACGTACTCAAACGTGCCCAAGCATTGGCGCAGGAATTGGGTATTTGA 

NBM0341 Protein sequence 

35 MPAGRLPRRCPMMTKFTDCTRSNRIQPPTHRGYILKNNRQIKLIAASVAVAASFQAHAGL 
GGLNIQSNLDEPFSGSITVTGEEAKALLGGGSVTVSEKGLTAKVHKLGDKAVIAVSSEQA 
VRDPVLVFRIGAGAQVREYTAILDPVGYSPKTKSALSDGKTHRKTAPTAESQENQNAKAL 
RKTDKKDSANAAVKPAYNGKTHTVRKGETVKQIAAAIRPKHLTLEQVADALLKANPNVSA 
HGRLRAGSVLHIPWLNRIKAEQPKPQTAKPKAETASMPSEPSKQATVEKPVEKPEAKVAA 

40 PEAKAEKPAVRPEPVPAANTAASETAAESAPQEAAASAIDTPTDETGNAVSEPVEQVSAE 
EETESGLFDGLFGGSYTLLLAGGGAALIALLLLLRLAQSKRARRTEESVPEEEPDLDDAA 
DDGIEITFAEVETPATPEPAPKNDVNDTLALDGESEEELSAKQTFDVETDTPSNRIDLDF 
DSLAAAQNGILSGALTQDEETQKRADADWNAIESTDSVYEPETFNPYNPVEIVIDTPEPE 
SVAQTAENKPETVDTDFSDNLPSNNHIGTEETASAKPASPSGLAGFLKASSPETILEKTV 

45 AEVQTPEELHDFLKVYETDAVAETAPETPDFNAAADDLSALLQPAEAPSVEENITETVAE 
TPDFNATADDLSALLQPSKVPAVEENAAETVADDLSALLQPAEAPAVEENVTETVAETPD 
FNATADDLSALLQPSEAPAVEENAAETVADDLSALLQPAEAPAVEENAAEITLETPDSNT 
SEADALPDFLKDGEEETVDWSIYLSEENIPNNADTSFPSESVGSDAPSEAKYDLAEMYLE 
IGDRDAAAETVQKLLEEAEGDVLKRAQALAQELGI 

50 

NMB0338 DNA sequence 

ATGGAAAGGAACGGTGTATTTGGTAAAATTGTCGGCAATCGCATACTCCGTATGTCGTCC 
GAACACGCTGCCGCATCCTATCCGAAACCGTGCAAATCGTTTAAACTAGCGCAATCTTGG 
TTCAGAGTGCGAAGCTGTCTGGGCGGCGTTTTTATTTACGGAGCAAACATGT^AACTTATC 
55 TATACCGTCATCAAAATCATTATCCTGCTGCTCTTCCTGCTGCTTGCCGTCATTAATACG 
GATGCCGTTACCTTTTCCTACCTGCCGGGGCAAAAATTCGATTTGCCGCTGATTGTCGTA 
TTGTTCGGCGCATTTGTAGTCGGTATTATTTTTGGAATGTTTGCCTTGTTCGGACGGTTG 
TTGTCGTTACGTGGCGAGAACGGCAGGTTGCGTGCCGAAGTAAAGAAAAATGCGCGTTTG 
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ACGGGGAAGGAGCTGACCGCACCACCGGCGCAA^TGCGCCCGAATCTACCAAACAGCCT 
TAA 

NMB0338 Protein sequence 

5 MERNGVFGKIVGNRILRMSSEHAAASYPKPCKSFKLAQSWFRVRSCLGGVFIYGANMKLI 
YT VI KI 1 1 LLL FLLLAVI NT DAVT FS YL PGQKFDL PL I VVL FGAFVVG 1 1 FGMFAL FGRL 
LSLRGENGRLRAEVKKNARLTGKELTAPPAQNAPESTKQP 

Analysis of the polypeptide indicates that it is predicted to have two membrane 
10 spanning domains, from residues 54 to 70 and 88 to 107. Thus, fragments from 
the regions 1 to 53, and 108 to the end (C-tenninal) may be particularly useful as 
immunogens. 



NMB1345 DNA sequence 

15 ATGAAAAAACCTTTGATTTCGGTTGCGGCAGCATTGCTCGGCGTTGCTTTGGGCACGCCT 
TAT TAT TTGGGTGT C AAAGCC G AAGAAAGC T T G AC G C AGC AG C AAAAAAT AT T G C AG G AA 
ACGGGCTTCTTGACCGTCGAATCGCACCAATATGAGCGCGGCTGGTTTACCTCTATGGAA 
ACGACGGTCATCCGTCTGAAACCCGAGTTGCTGAATAATGCCCGAAAA.TACCTGCCGGAT 
AACCTGAAAACAGTGTTGGAACAGCCGGTTACGCTGGTTAACCATATCACGCACGGCCCT 

20 TTCGCCGGCGGATTCGGCACGCAGGCGTACATTGAAACCGAGTTCAAATACGCGCCTGAA 
ACGGAAAAAGTTCTGGAACGCTTTTTTGGAAAACAAGTCCCGGCTTCCCTTGCCAATACC 
GTTTATTTTAACGGCAGCGGTAAAATGGAAGTCAGTGTTCCCGCCTTCGATTATGAAGAG 
CTGTCGGGCATCAGGCTGCACTGGGAAGGCCTGACGGGAGAAACGGTTTATCAAAAAGGT 
TTCAAAAGCTACCGGAACGGCTATGATGCCCCCTTGTTTAAAATCAAGCTGGCAGACAAA 

25 GGCGATGCCGCGTTTGAAAAAGTGCATTTCGATTCGGAAACTTCAGACGGCATCAATCCG 
CTTGCTTTGGGCAGCAGCAATCTGACCTTGGAAAAATTCTCCCTAGAATGGAAAGAGGGT 
G T C GAT T AC AAC G T C AAG T T AAAC G AAC T G G T C AAT C T T G T TAG C GAT T T G C AG AT T G G C 
GCGTTTATCAA.TCCCAACGGCAGCATCGCACCTTCCAA?\ATCGAA.GTCGGCAAACTGGCT 
TTTTCAACCAAGACCGGGGAATCAGGCGCGTTTATCAACAGTGAAGGGCAGTTCCGTTTC 

30 GATACACTGGTGTACGGCGATGAAAAATACGGCCCGCTGGACATCCATATCGCTGCCGAA 
CACCTCGATGCTTCTGCCTTAACCGTATTGAAACGCAAGTTTGCACAAATTTCCGCCA7VA 
AAAATGACCGAGGAACAAATCCGCAATGATTTGATTGCCGCCGTCAAAGGAGAGGCTTCC 
GGACTGTTCACCAACAATCCCGTATTGGACATTAAAACTTTCCGATTCACGCTGCCATCG 
G G AAAAAT C GAT G T G GG C G G AAAAAT CAT G T T T AAAG AC AT G AAGAAG G AAG AT T T G AAT 

35 CAATTGGGTTTGATGCTGAAGAAAACCGAAGCCGACATCAGAATGAGTATTCCCCAAAAA 
ATGCTGGAAGACTTGGCGGTCAGTCAAGCAGGCAATATTTTCAGCGTCAATGCCGAAGAT 
GAGGCGGAAGGCAGGGCAAGTCTTGACGACATCAACGAGACCTTGCGCCTGATGGTGGAC 
AG T AC GG T T C AG AG TAT G G C AAG GG AAAAAT AT C T G AC T T T G AAC GG C G AC C AG AT T GAT 
AC T GC C AT T T C T C T G AAAAAC AAT C AG T T G A£AT T GAAC GG T AAAAC GT T G C AAAAC G AA 

40 CCGGAGCCGGATTTTGATGAAGGCGGTATGGTTTCAGAGCCGCAGCAGTAA 

NMB1345 Protein sequence 

MKKPLISVAAALLGVALGTPYYLGVKAEESLTQQQKILQETGFLTVESHQYERGWFTSME 
TTVIRLKPELLNNARKYLPDNLKTVLEQPVTLVNHITHGPFAGGFGTQAYIETEFKYAPE 

45 TEKVLERFFGKQVPASLANTVYFNGSGKMEVSVPAFDYEELSGIRLHWEGLTGETVYQKG 
FKSYRNGYDAPLFKIKLADKGDAAFEBCVHFDSETSDGINPLALGSSNLTLEKFSLEWKEG 
VDYNVKLNELVNLVTDLQIGAFINPNGSIAPSKIEVGKLAFSTKTGESGAFINSEGQFRF 
DTLVYGDEKYGPLDIHIAAEHLDASALTVLKRKFAQISAKKMTEEQIRNDLIAAVKGEAS 
GLFTNNPVLDIKTFRFTLPSGKIDVGGKIMFKDMKKEDLNQLGLMLKKTEADIRMSIPQK 

50 MLEDLAVSQAGNIFSVNAEDEAEGRASLDDINETLRLMVDSTVQSMAREKYLTLNGDQID 
TAISLKNNQLKLNGKTLQNEPEPDFDEGGMVSEPQQ 
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Selection with vaccinees sera 



Sera from the Meningococcal Reference Laboratory in Manchester has been made 
available to us. This sera has come from a clinical trial of OMV immunisation of 
5 volunteers. 



Mutants selected by vaccinee CI sera (screened once) 



The following sequences were isolated 

10 

NMB0338 (as above) 



NMB0738 DNA sequence 

ATGAAGATCGTCCTGATTAGCGGCCTGTCCGGTTCGGGCAAGTCCGTCGCACTGCGCCAA 
15 ATGGAAGATTCGGGTTATTTCTGCGTGGACAATTTGCCTTTGGAAATGTTGCCCGCGCTG 
GTGTCGTATCATATCGAACGTGCGGACGAAACCGAATTGGCGGTCAGCGTCGATGTGCGT 
TCCGGCATTGACATCGGACAGGCGCGGGAACAGATTGCCTCTCTGCGCAGACTGGGGCAC 
AGGGTTGAAGTTTTGTTTGTCGAGGCGGAAGAAAGCGTGTTGGTCCGCCGGTTTTCCGAA 
ACCAGGCGAGGACATCCTCTGAGCAATCAGGATATGACCTTGTTGGAAAGCTTAAAGAAA 
20 GAACGGGAATGGCTGTTCCCGCTTAAAGAAATCGCCTATTGTATCGACACTTCCAAGATG 
AATGCCCAACAGCTCCGCCATGCAGTCCGGCAGTGGCTGAAGGTCGAACGTACCGGGCTG 
CTGGTGATTTTGGAGTCCTTCGGGTTCAAATACGGTGTGCCGAACAACGCGGATTTTATG 
TTCGATATGCGCAGCCTGCCCAACCCGTATTACGATCCCGAGTTGAGGCCTTACACCGGT 
ATGGACAAGCCCGTTTGGGATTATTTGGACGGACAGCCGCTTGTGCAGGAAATGGTTGAC 
25 GACATCGAAAGGTTTGTTACGCATTGGTTACCGCGTTTGGAGGATGAAAGCAGGAGCTAC 
GTTACCGTCGCCATCGGTTGCACGGGAGGACAGCACCGTTCGGTCTATATTGTCGAAAAA 
CTCGCCCGAAGGTTGAAAGGGCGTTATGAATTGCTGATACGGCACAGACAGGCGCAAAAC 
CT GT C AGACCGCT AA 

30 NMB0738 Protein sequence 

MKIVLISGLSGSGKSVALRQMEDSGYFCVDNLPLEMLPALVSYHIERADETELAVSVDVR 
SGIDIGQAREQIASLRRLGHRVEVLFVEAEESVLVRRFSETRRGHPLSNQDMTLLESLKK 
EREWL FPLKE I AY C I DT S KMN AQQLRH AVRQWLKVERT G LLVI LE S FG FK YG V PNN AD FM 
FDMRSLPNPYYDPELRPYTGMDKPVWDYLDGQPLVQEMVDDIERFVTHWLPRLEDESRSY 
3 5 VT VAI GCT GGQHRS V Y I VEKL ARRLKGRYELL I RHRQ AQNL SDR 

NMB0792 NadC family (transporter) DNA sequence 

ATGAACCTGCATGCAAAGGACAAAACCCAGCATCCCGAAAACGTCGAGCTGCTCAGTGCG 
CAGAAGCCGATTACCGACTTTAAGGGCCTGCTGACCACCATTATTTCCGCCGTCGTCTGT 

40 TTCGGCATTTACCACATCCTGCCTTACAGCCCCGATGCCAATAAAGGTATCGCGCTGCTG 
ATTTTCGTTGCCGCACTTXGGTTTACCGAGGCCGTCCACATTACCGTAACCGCACTGATG 
GTGCCGATTCTCGCCGTCGTACTCGGTTTCCCCGACATGGACATCAAAAAGGCGATGGCT 
GATTTTTCCAACCCGATTATCTACATTTTTTTCGGCGGCTTCGCGCTTGCCACCGCCCTG 
CATATGCAGCGGCTGGACCGTAAAATCGCCGTCAGCCTGTTGCGCCTGTCGCGCGGCAAT 

45 ATGAAAGTGGCGGTTTTGATGTTGTTCCTCGTTACCGCCTTTCTGTCCATGTGGATCAGC 
AACACCGCCACCGCCGCGATGATGCTGCCTCTAGCAATGGGTATGCTGAGCCACCTCGAC 
CAGGAAAAAGAAC AC AAAACCT ACGTCT T CCT CCT GCT CGGCAT CGCCT AT T GCGCC AGC 
ATCGGCGGCTTGGGCACGCTCGTCGGCTCGCCGCCCAACCTGATTGCCGCCAAAGCCCTA 
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AATCTGGACTTCGTCGGCTGGATGAAGCTCGGCCTGCCGATGATGCTGTTGATTCTGCCC 
TTGATGCTGCTCTCCCTGTACGTCATCCTCAAACCTAATTTGAACGAACGCGTGGAAATC 
AAAGCCGAATCCATCCCTTGGACGCTGCACCGCGTGATCGCGCTGTTGATTTTCCTTGCC 
ACAGCCGCCGCGTGGATATTCAGCTCCAAAATCAAAACCGCCXTCGGCATTTCCAATCCC 
5 GACACCGTTATCGCCCTGAGTGCCGCCGTCGCCGTCGTCGTCTTCGGCGTGGCGCAATGG 
AAGGAAGTCGCCCGCAATACCGACTGGGGCGTGTTGATGCTCTTCGGCGGCGGCATCAGC 
CTGAGCACGCTGTTGAAAACATCCGGCGCGTCCGAAGCCTTGGGACAGCAGGTTGCCGCC 
ACCTTTTCCGGCGCGCCCGCATTTTTGGTGATACTCATCGTCGCCGCCTTCATTATTTTT 
CTGACCGAGTTCACCAGCAACACCGCCTCCGCCGCATTGCTTGTACCGATTTTCTCCGGC 
10 ATCGCTATGCAGATGGGGCTGCCCGAACAAGTCTTGGTATTCGTCATCGGCATCGGCGCA 
TCTTGTGCCTTCATGCTGCCGGTTGCCACACCGCCTAACGCGATTGTGTTCGGCACGGGC 
TTAATCAAGCAACGCGAAATGATGAATGTCGGCATACTGCTGAACATCCTCTGCGTAGTA 
TTGGTTGCTCTGTGGGCTTATGCTGTACTGATGTAA 

15 NMB0792 Protein sequence 

MNLHAKDKTQHPETSIVELLSAQKPITDFKGLLTTIISAVVCFGiyHILPYSPDANKGIALL 
IFVAALWFTEAVHITVTALMVPILAVVLGFPDMDIKKAMADFSNPIIYIFFGGFALATAL 
HMQRLDRKIAVSLLRLSRGISrMKVAVLMLFLVTAFLSMWISNTATAAMMLPLAMGMLSHIiD 
QEKBHKTYVFLLLGIAYCASIGGLGTLVGSPPNLIAAKALNLDFVGWMKLGLPMMLLILP 
20 LMLLSLYVILKP1SIL1SJERVEIKAESIPWTLHRVIALLIFLATAAAWIFSSKIKTAFGISNP 
DT VI ALS AAVAVVVFGVAQWKEVARNTDWG VLMLFGGGI S LS TLLKT S GAS E ALGQQVAA 
TFSGAPAFLVILIVAAFIIFLTEFTSNTASAALLVPIFSGIAMQMGLPEQVLVFVIGIGA 
SCAFMLPVATPPNAIVFGTGLIKQREMMNVGILLNILCWLVALWAYAVLM 

25 NMB0279 DNA sequence 

ATGCAACGACAAATCAAACTGAAAAATTGGCTTCAGACCGTTTATCCCGAACGGGACTTC 
GATCTGACTTTTGCGGCGGCGGATGCTGATTTCCGCCGCTATTTCCGTGCAACGTTTTCA 
GACGGCAGCAGTGTCGTCTGCATGGATGCACCGCCCGACAAGATGAGTGTCGCACCTTAT 
TTGAAAGTGCAGAAACTGTTTGACATGGTCAATGTGCCGCAGGTATTGCACGCGGACACG 

30 GATCTGGGGTTTGTGGTATTGAACGACTTGGGCAATACGACGTTTTTGACCGCAATGCTT 
CAGGAACAGGGCGAAACGGCGCACAAAGCCCTGCTTTTGGAGGCAATCGGCGAGTTGGTC 
GAATTGCAGAAGGCGAGCCGTGAAGGGGTTTTGCCCGAATATGACCGTGAAACGATGTTG 
CGCGAAATCAACCTGTTCCCGGAATGGTTTGTCGCAAAAGAATTGGGGCGCGAATTAACA 
TTCAAACAACGCCAACTTTGGCAGCAAACCGTCGATACGCTGCTGCCGCCCCTGTTGGCG 

35 CAGCCCAAAGTCTATGTGCACCGCGACTTTATCGTCCGCAACCTGATGCTGACGCGCGGC 
AGGCCGGGCGTTTTAGACrTCCAAGACGCGCTTTACGGCCCGATTTCCTACGATTTGGTG 
TCGCTGTTGCGCGATGCCTTTATCGAATGGGAAGAAGAATTTGTCTTGGACTTGGTTATC 
CGCTACTGGGAAAAGGCGCGGGCTGCCGGCTTGCCCGTCCCCGAAGCGTTTGACGAGTTT 
TACCGCTGGTTCGAATGGATGGGCGTGCAGCGGCACTTGAAGGTTGCAGGCATCTTCGCA 

40 CGCCTGTACTACCGCGACGGCAAAGACAAATACCGTCCGGAAATCCCGCGTTTCTTAAAC 
TATCTGCGCCGCGTATCGCGCCGTTATGCCGAACTCGCCCCGCTCTACGCGCTCTTGGTC 
GAACTGGTCGGCGATGAAGAACTGGAAACGGGCTTTACGTTTTAA 

NMB0279 Protein sequence 

45 MQRQIKLKNWLQTVYPERDFDLTFAAADADFRRYFRATFSDGSSVVCMDAPPDKMSVAPY 
LKVQKLFDMVNVPQVLHADTDLGFVVLNDLGNTTFLTAMLQEQGETAHKALLLEAIGELV 
ELQKASREGVLPEYDRETMLREINLFPEWFVAKELGRELTFKQRQLWQQTVDTLLPPLLA 
QPKVYVHRDFIVRNLMLTRGRPGVLDFQDALYGPISYDLVSLLRDAFIEWEEEFVLDLVI 
RYWEKARAAGLPVPEAFDEFYRWFEWMGVQRHLKVAGIFARLYYRDGKDKYRPEIPRFLN 

50 YLRRVSRRY AEL APLY ALLVELVGDEELET GFT F 

NMB2050 DNA sequence 

ATGGAACTGATGACTGTTTTGCTGCCTTTGGCGGCGTTGGTGTCGGGCGTGTTGTTTACA 
TGGTTGCTGATGAAGGGCCGGTTTCAGGGCGAGTTTGCCGGTTTGAACGCGCACCTGGCG 
55 GAAAAGGCGGCAAGATGTGATTTTGTCGAACAGGCACACGGCAAAACCGTGTCGGAATTG 
GCGGTGTTGGACGGGT^AATACCGGCATTTGCAGGACGAAAATTATGCTTTGGGCAACCGT 
TTTTCCGCAGCCGAAAAGCAGATTGCCCATTTGCAGGAAAAAGAGGCGGAGTCGGCGCGG 
CTGAAGCAGTCGTATATCGAGTTGCAGGAAAAGGCACAGGGTTTGGCGGTTGAAAACGAA 
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CGTTTGGCAACGCAGCTCGGACAGGAACGGAAGGCGTTTGCCGACCAATATGCCTTGGAA 
CGCCAAATCCGCCAAAGAATCGAAACCGATTTGGAAGAAAGCCGCCAAACTGTCCGCGAC 
GTGCAAAACGACCTTTCCGATGTCGGCAACCGTTTTGCCGCAGCCGAAAAACAGATTGCC 
CATTTGCAGGAAAAAGAGGCGGAAGCGGAGCGGTTGAGGCAGTCGCATACCGAGTTGCAG 
5 GAAAAGGCACAGGGTTTGGCGGTTGAAAACGAACGTTTGGCAACGCAAATCGAACAGGAA 
CGCCTTGCTTCTGAAGAGAAGCTGTCCTTGCTGGGCGAGGCGCGCAAAAGTTTGAGCGAT 
CAGTTTCAAAATCTTGCCAACACGATTTTGGAAGAAAAAAGCCGCCGTTTTACCGAGCAG 
AACCGCGAGCAGCTCCATCAGGTTTTGAACCCGCTAAACGAACGCATCCACGGTTTCGGC 
GAGTTGGTCAAGCAAACCTATGATAAAGAATCGCGCGAGCGGCTGACGTTGGAAAACGAA 

10 TTGAAACGGCTTCAGGGGTTGAACGCGCAGCTGCACAGCGAGGCAAAGGCCCTGACCAAC 
GCGCTGACCGGTACGCAGAATAAGGTTCAGGGCAATTGGGGCGAGATGATTCTGGAAACG 
GTTTTGGAAAATTCCGGCCTTCAGAAAGGGCGGGAATATGTGGTTCAGGCGGCATCCGTC 
CGAAAAGAGGAAGACGGCGGCACGCGCCGCCTCCAGCCCGACGTTTTGGTCAACCTGCCC 
GACAACAAGCAGATTGTGATTGATTCCAAGGTCTCGCTGACAGCTTATGTGCGCTACACG 

15 CAGGCGGCGGATGCGGATACGGCGGCACGCGAACTGGCGGCACACGTTGCCAGCATCCGT 
G C AC AC AT G AAAG GCTTGTCGCT G AAG GAT T AC AC C GAT T T G G AAG G T G T GAAC AC AT T G 
GATTTCGTCTTTATGTTTATCCCTGTCGAACCGGCCTACCTGTTGGCGTTGCAGAATGAC 
GCGGGCTTGTTCCAAGAGTGTTTCGACAAACGGATTATGCTGGTCGGCCCCAGTACGCTG 
CTGGCGACTTTGAGGACGGTGGCGAATATTTGGCGCAACGAACAGCAAAATCAGAACGCA 

20 CTGGCGATTGCGGACGAAGGCGGCAAGCTGTACGACAAGTTTGTCGGCTTCGTACAGACG 
CTCGAAAGCGTCGGCAAAGGCATCGATCAGGCGCAAAGCAGTTTTCAGACGGCATTCAAG 
CAACTTGCCGAAGGGCGCGGGAATCTGGTCGGACGCGCCGAGAAACTGCGTCTGTTGGGC 
GTGAAGGCAGGCAAACAACTTCAACGGGATTTGGTCGAGCGTTCCAATGAAACAACGGCG 
TTGTCGGAATCTTTGGAATACGCGGCAGAAGATGAAGCAGTCTGA 

25 

NMB2050 Protein sequence 

MELMTVLLPLAALVSGVLFTWLLMKGRFOGEFAGLNAHLAEKAARCDFVEQAHGKTVSEL 
AVLDGKYRHLQDENYALGNRFSAAEKQIAHLQEKEAESARLKQSYIELQEKAQGLAVENE 
RLATQLGQERKAFADQYALERQIRQRIETDLEESRQTVRDVQNDLSDVGNRFAAAEKQIA 

30 HLQEKEAEAERLRQSHTELQEKAQGLAVENERLATQIEQERLASEEKLSLLGEARKSLSD 
OFQNLANTILEEKSRRFTEQNREQLHQVLNPLNERIHGFGELVKQTYDKESRERLTLENE 
LKRLQGLNAQLHSEAKALTNALTGTQNKVQGNWGEMILETVLENSGLQKGREYWQAASV 
RKEEDGGTRRLOPDVLVNLPDNKQIVIDSKVSLTAYVRYTQAADADTAARELAAHVASIR 
AHMKGLSLKDYTDLEGVNTLDFVFMFIPVEPAYLLALQNDAGLFQECFDKRIMLVGPSTL 

35 LATLRTVANIWRNEQQNQNALA1ADEGGKLYDKFVGFVQTLESVGKGIDQAQSSFQTAFK 
QLAEGRGNLVGRAEKLRLLGVKAGKQLQRDLVERSNETTALSESLEYAAEDEAV 

NMB1335 CreA protein DNA sequence 

ATGAACAGACTGCTACTGCTGTCTGCCGCCGTCCTGCTGACTGCCTGCGGCAGCGGCGAA 
40 ACCGATAAAATCGGACGGGCAAGTACCGTTTTCAACATACTGGGCAAAAACGACCGTATC 
GAAGTGGAAGGATTCGACGATCCCGACGTTCAAGGGGTTGCCTGTTATATTTCGTATGCA 
AAAAAAGGCGGCTTGAAGGAAATGGTCAATTTGGAAGAGGACGCGTCCGACGCATCGGTT 
TCGTGCGTTCAGACGGCATCTTCGATTTCTTTTGACGAAACCGCCGTGCGCAAACCGAAA 
GAAGTTTTCAAACACGGTGCGAGCTTCGCGTTCAAGAGCCGGCAGATTGTCCGTTATTAC 
45 GACCCCAAACGCAAAACCTTCGCCTATTTGGTGTACAGCGATAAAATCATCCAAGGCTCG 
CCGAAAAATTCCTTAAGCGCGGTTTCCTGTTTCGGCGGCGGCATACCGCTiAACCGATGGG 
GTGCAAGCCGATACTTCCGGCAACCTGCTTGCCGGCGCCTGCATGATTTCCAACCCGATA 
GAAAATCTCGACAAACGCTGA 

5 0 NMB 1335 Protein sequence 

MNRLLLLSAAVLLTACGSGETDKIGRASTVFNILGKNDRIEVEGFDDPDVQGVACYISYA 
KKGGLKEMVNLEEDAS DASVSCVQTASSISFDETAVRKPKEVFKHGASFAFKSRQIVRYY 
DPKRKTFAYLVYSDKIIQGSPKNSLSAVSCFGGGIPQTDGVQADTSGNLLAGACMISNPI 
ENLDKR 

55 

NMB2035 DNA sequence 

ATGACCGCCTTTGTCCACACCCTTTCAGACGGCATGGAACTGACCGTCGAAATCAAGCGC 
CGTGCCAAGAAAAACCTGATTATCCGCCCCGCCGGCACACATACCGTCCGCAT CAGCGTC 
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CCACCCTGCTTCTCCGTCTCCGCTCTAAACCGCTGGCTGTATGAAAACGAAGCCGTCCTG 
CGGCAAACACTGGCGAAAACACCGCCGCCGCAAACTGCCGAAAACCGGCTGCCCGAATCC 
ATCCTCTTCCACGGCAGACAGCTTGCCCTCACCGCCCATCAAGACACGCAAATCCTGCTG 
ATGCCGTCTGAAATCCGTGTTCCCGAAGGCGCACCCGAAAAACAGCTTGCGCTGCTGCGG 
5 GACTTTTTGGAACGGCAGGCGCACAGTTACCTGATTCCCCGCCTCGAACGCCACGCCCGC 
ACCACACAACTGTTCCCCGCCTCCTCCTCGCTGACCTCTGCCAAAACCTTCTGGGGCGTG 
TGCCGCAAAACCACAGGCATACGCTTCAACTGGCGGCTGGTCGGCGCACCGGAATACGTT 
GCCGACTATGTCTGCATACACGAACTCTGCCACCTCGCCCATCCCGACCACAGCCCCGCC 
TTTTGGGAACTGACCCGCCGCTTCGCCCCCTACACGCCCAAAGCGAAACAGTGGCTCAAA 
10 ATCCACGGCAGGGAACTTTTCGCCTTAGGCTGA 

NMB2035 Protein sequence 

MTAFVHTLSDGMELTVEIKRRAKKNLIIRPAGTHTVRISVPPCFSVSALNRWLYENEAVL 
RQTLAKTPPPQTAENRLPESILFHGRQLALTAHQDTQILLMPSEIRVPEGAPEKQLALLR 
15 DFLERQAHSYLIPRLERHARTTQLFPASSSLTSAKT FWGVCRKTTGIRFNWRLVGAPEYV 
ADYVCIHELCHLAHPDHSPAFWELTRRFAPYTPKAKQWLKIHGRELFALG 

NMB1351 Fmu and Fmv protein DNA sequence 

ATGAACGCCGCACAACTCGACCATACCGCCAAAGTTTTGGCTGAAATGCTGACTTTCAAA 

20 CAGCCTGCCGATGCCGTCCTCTCCGCCTATTTCCGCGAACACAAAAAGCTCGGCAGTCAA 
GATCGCCACGAAATCGCCGAAACCGCCTTTGCCGCGCTGCGCCACTATCAAAAAATCAGT 
ACCGCCCTACGCCGTCCGCACGCGCAGCCGCGCAAAGCCGCTCTCGCCGCACTGGTTCTC 
G G C A G AAG C AC C AAC AT C AG C C AAAT C AAAG AC C T G C T T GAT G AAG AAG AAAC AG C G T T C 
CTCGGCAATTTGAAAGCCCGTAAAACCGAGTTTTCAGACAGCCTGAATACCGCCGCAGAA 

25 TTGCCGCAATGGCTGGTGGAACAACTGAAACAGCATTGGCGCGAAGAAGAAATCCTCGCT 
TTCGGCCGCAGCATCAACCAGCCTGCCCCGCTCGACATCCGCGTCAACACTTTGAAAGGC 
AAACGCGATAAAGTGCTGCCGCTGTTGCAAGCCGAAAGTGCCGATGCAGAGGCAACGCCT 
TATTCGCCTTGGGGCATCCGCCTGAAAAACAAAATCGCGCTTAACAAACACGAACTGTTT 
TTAGACGGCACACTGGAAGTCCAAGACGAAGGCAGCCAGCTGCTTGCCTTATTGGTGGGC 

30 GCAAAACGAGGCGAAATCATTGTCGATTTCTGTGCCGGTGCCGGCGGTAAAACCTTGGCT 
GTCGGTGCGCAAATGGCGAACAAAGGCAGAATCTACGCCTTCGATATCGCCGAAAAACGC 
CTTGCCAACCTCAAACCGCGTATGACCCGCGCCGGACTGACCAATATCCACCCCGAACGC 
ATCGGCAGCGAACACGATGCCCGTATCGCCCGACTGGCAGGCAAAGCCGACCGTGTGTTG 
GTGGACGCGCCCTGCTCCGGTTTGGGCACTTTACGCCGCAATCCCGACCTCAAATACCGC 

35 CAATCCGCCGAAACCGTCGCCAACCTTTTGGAACAGCAACACAGCATCCTCGATGCCGCC 
TCCAAACTGGTAAAACCGCAAGGACGTTTGGTGTACGCCACTTGCAGCATCCTGCCCGAA 
GAAAACGAGCTGCAAGTCGAACGTTTCCTGTCCGAACATCCCGAATTTGAACCCGTCAAC 
TGCGCCGAACTGCTTGCCGGTTTGAAAATCGATTTGGATACCGGCAAATACCTGCGCCTC 
AACTCCGCCCGACACCAAACCGACGGCTTCTTCGCCGCCGTATTGCAACGCAAATAA 

40 

NMB 1351 Protein sequence 

MNAAQLDHTAKVLAEMLTFKQPADAVLSAYFREHKKLGSQDRHEIAETAFAALRHYQKIS 
TALRRPHAQPRKAALAALVLGRS TNI SQIKDLLDEEETAFLGNLKARKTEFS DSLNTAAE 
LPQWLVEQLKQHWREEEILAFGRSINQPAPLDIRVNTLKGKRDKVLPLLQAESADAEATP 
45 YSPWGIRLKNKIALNKHELFLDGTLEVQDEGSQLLALLVGAKRGEIIVDFCAGAGGKTLA 
VGAQMANKGRIYAFDIAEKRLANLKPRMTRAGLTNIHPERIGSEHDARIARLAGECADRVL 
VDAPCSGLGTLRRNPDLKYRQSAETVANLLEQQHSILDAASKLVKPQGRLVYATCSILPE 
ENELQVERFLSEHPEFEPVNCAELLAGLKIDLDTGKYLRLNSARHQTDGFFAAVLQRK 

50 NMB 1 574 IlvC DNA sequence 

ATGCAAGTCTATTACGATAAAGATGCCGATCTGTCCCTAATCAAAGGCAAAACCGTTGCC 
ATCATCGGTTACGGTTCGCAAGGTCATGCCCATGCCGCCAACCTGAAAGATTCGGGTGTA 
AACGTGGTGATTGGTCTGCGCCAAGGTTCTTCTTGGAAAAAAGCCGAAGCAGCCGGTCAT 
GTCGTCAAAACCGTTGCTGAAGCGACCAAAGAAGCCGATGTCGTTATGCTGCTGCTGCCT 
55 GACGAAACCATGCCTGCCGTCTATCACGCCGAAGTTACAGCCAATTTGAAAGAAGGCGCA 
ACGCTGGCATTTGCACACGGCTTCAACGTGCACTACAACCAAATCGTTCCGCGTGCCGAC 
TTGGACGTGATTATGGTTGCCCCCAAAGGTCCGGGCCATACCGTACGCAGTGAATACAAA 
CGCGGCGGCGGCGTGCCTTCTCTGATTGCCGTTTACCAAGACAATTCCGGCAAAGCCAAA 
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GACATCGCCCTGTCTTATGCGGCTGCCAACGGCGGCACCAAAGGCGGTGTGATTGAAACC 
ACTTTCCGCGAAGAAACCGAAACCGATCTGTTCGGCGAACAAGCCGTATTGTGCGGCGGC 
GTGGTCGAGTTGATCAAGGCGGGTTTTGAAACCCTGACCGAAGCCGGTTACGCGCCTGAA 
ATGGCTTACTTCGAATGTCTGCACGAAATGAAACTGATCGTTGACCTGATTTTCGAAGGC 
5 GGT AT T G C C AAT AT G AAC TAG T C CAT T T CC AAC AAT G CG GAG T AC GG C GAAT AC G T T AC C 
GGCCCTGAAGTGGTCAATGCTTCCAGCAAAGAAGCCATGCGCAATGCCCTGAAACGCATT 
CAAACCGGCGAATACGCAAAAATGTTTATCCAAGAGGGTAATGTCAACTATGCGTCTATG 
ACTGCCCGCCGCCGTCTGAATGCCGACCACCAAGTTGAAAAAGTCGGCGCACAACTGCGT 
GCCATGATGCCTTGGATTACTGCCAACAAATTGGTTGACCAAGACAAAAACTGA 

10 

NMB1574 Protein sequence 

MQVYYDKDADLSLIKGKTVAI I GYGSQGHAHAANLKDSGVNVVI GLRQG S S WKKAE AAGH 
VVKTVAEATKEADVVMLLLPDETMPAVYHAEVTANLKEGATLAFAHGFNVHYNQIVPRAD 
LDVIMVAPKGPGHTVKSEYKRGGGVPSLIAVYQDNSGKAKDIALSYAAANGGTKGGVIET 
15 TFREETETDLFGEQAVLCGGVVELIKAGFETLTEAGYAPEMAYFECLHEMKLIVDLIFEG 
GIANMNYS I SNNAEYGE YVTGPEVVNAS SKEAMRNALKRI QTGEYAKMFIQEGNVNYASM 
TARRRLNADHQVEPCVGAQLRAMMPWITANKLVDQDKN 

NMB1298 rsuA DNA sequence 

20 AT G AAAC T TAT C AAAT AC C T G C AAT AT C AAG GC AT AG G AAG C C GC AA G C AGT G C C AAT GG 
CTGATTGCCGGCGGTTATGTTTTCATCAACGGAACCTGCATGGACGACACCGATGCAGAC 
ATCGATTCCTCATCCGTCGAAACGTTGGATATTGACGGGGAAGCAGTAACCGTCGTTCCC 
G AAC C C TAT T T C T AC AT CAT G C T C AAC AAG C C T G AAG AT T AC GAAAC T T C GC AC AAAC C C 
AAGCACTACCGCAGCGTATTCAGCCTGTTCCCCGACAATATGCGGAACATCGATATGCAG 

25 GCGGTCGGCAGGCTGGATGCAGATACGACCGGCGTATTGCTGATTACCAACGACGGCAAA 
CTGAACCACAGCCTGACTTCGCCGAGCAGAAAAATTCCCAAGCTGTACGAAGTAACGCTC 
AAACACCCCACAGGAGAAACGCTCTGCGAAACCTTGAAAAACGGCGTGCTGCTCCACGAC 
GAAAACGAAACCGTTTGTGCCGCCGATGCCGTTTTGAAAAACCCGACCACCCTGCTGCTG 
ACCATTACCGAAGGAAAATACCACCAAGTCAAACGCATGATCGCCGCCGCCGGCAACCGC 

30 GTGC71ACACCTTCATCGCCGGCGATTCGCACATCTGGAAACAGAAAACCTCAAACCCGGG 
GAATGGAAATTTATCGAATGTCCAAAATTCTGA 

NMB1298 Protein sequence 

MKLIKYLQYQGIGSRKQCQWLIAGGYVFINGTCMDDTDADIDSSSVETLDIDGEAVTVVP 
35 EPYFYIMLNKPEDYETSHKPKHYRSVFSLFPDNMRNIDMQAVGRLDADTTGVLLITNDGK 
LNHSLTSPSRKIPKLYEVTLKHPTGETLCETLKNGVLLHDENETVCAADAVLKNPTTLLL 
TITEGKYHQVKRMIAAAGNRVQHLHRRRFAHLETENLKPGEWKFIECPKF 

NMB1856 Lys R family (transcription regulator) DNA sequence 

40 ATGAAAACCAATTCAGAAGAACTGACCGTATTTGTTCAAGTGGTGGAAAGCGGCAGCTTC 
AGCCGTGCGGCGGAGCAGTTGGCGATGGCAAATTCTGCCGTAAGCCGCATCGTCAAACGG 
CTGGAGGAAAAGTTGGGTGTGAACCTGCTCAACCGCACCACGCGGCAACTCAGTCTGACG 
GAAGAAGGCGCGCAATATTTCCGCCGCGCGCAGAGAATCCTGCAAGAAATGGCAGCGGCG 
GAAACCGAAATGCTGGCAGTGCACGAAATACCGCAAGGCGTGTTGAGCGTGGATTCCGCG 

45 ATGCCGATGGTGCTGCATCTGCTGGCGCCGCTGGCAGCAAAATTCAACGAACGCTATCCG 
CATATCCGACTTTCGCTCGTTTCTTCCGAAGGCTATATCAATCTGATTGAACGCAAAGTC 
GATATTGCCTTACGGGCCGGAGAATTGGACGATTCCGGGCTGCGTGCACGCCATCTGTTT 
GACAGCCGCTTCCGCGTAATCGCCAGTCCTGAATACCTGGCAAAACACGGCACGCCGCAA 
TCTACAGAAGAGCTTGCCGGCCACCAATGTTTAGGCTTCACCGAACCCGGTTCTCTAAAT 

50 ACATGGGCGGTTTTAGATGCGCAGGGAAATCCCTATAAGATTTCACCGCACTTTACCGCC 
AGCAGCGGTGAAATCTTACGCTCGTTGTGCCTTTCAGGTTGCGGTATTGTTTGCTTATCA 
GATTTTTTGGTTGACAACGACATCGCTGAAGGAAAGTTAATTCCCCTGCTCGCCGAACAA 
ACCTCCGATAAAACACACCCCTTTAATGCTGTTTATTACAGCGATAAAGCCGTCAATCTC 
CGCTTACGCGTATTTTTGGATTTTTTAGTGGAGGAACTGGGAAACAATCTCTGTGGATAA 

55 

NMB1856 Protein sequence 

MKTNSEELTVFVQVVESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
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EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
D FL VDNDI AEGKL I PLL AEQT S DKTH P FNAV Y Y S DKAVNLRLRVFLD FLVEELGNNLCG 

5 

NMB01 19 DNA sequence 

AT GAT G AAG GAT T T G AAT T T GAGC AAC AGC C T G T T C AAAG GC T AC AAC G AC AAAC AT G G C 
TTAATGATTTGTGGCTATGAATGGGGTTGGAGTAAAGCCGATGAGGCTGCTTATGTAGCA 
G G T G AAT AC AAAC T C C C T G AAAAC AAAAT C G AC CAT AC AT T T G C AAAC AAAT C C C T C TAT 

10 T T C GG AG AG C AGG C AAAAAAG T GGC G T T AC GAC AAT AC GAT AAAAAAT T GG T T T G AAAT G 
TGGGGACACCCCTTAGACGAAAATGGATTGGGCGGTGCATTTGAAAAATCCCTGGTTCAA 
AC C AAC TGGGCTGC T AC AC AG G G C AAC AC TAT C GAC AAT C C C GAC AAG T T C AC AC AAC C C 
GAGCACATCGATAATTTTCTCTACCACATCGAAAAACTGCGTCCGAAAGTCATCCTCTTC 
ATGGGCAGCAGGTTGGCGGATTTTCTGAACAACCAAAATGTACTGCCACGCTTCGAGCAG 

15 TTGGTCGG T AAG C AG AC C AAAC C GC T G GAG AC GG T G C AAAAAG AAT T T G ACGG T AC AC G T 
TTCAATGTCAAATTCCAATCGTTTGAAGATTGCGAAGTCGTCTGCTTTCCCCATCCCAGT 
GCCAGTCGCGGTCTATCTTACGATTACATCGCCTTGTTTGCGCCTGAAATGAACCGGATT 
T TAT C G GAC T T T AAAAC AAC AC G C GGAT T C AAAT AA 

20 NMBO 1 1 9 Protein sequence 

MMKDLNLSNSLFKGYNDKHGLMICGYEWGWSKADEAAYVAGEYKLPENKIDHTFANKSLY 
FGEQAKKWRYDNTIKNWFEMWGHPLDENGLGGAFEKSLVQTNWAATQGNTIDNPDKFTQP 
EHIDNFLYHIEKLRPKVILFMGSRLADFLNNQNVLPRFEQLVGKQTKPLETVQKEFDGTR 
FNVKFQSFEDCEVVCFPHPSASRGLSYDYIALFAPEMNRILSDFKTTRGFK 

25 

NMB1705 rfaK DNA sequence 

ATGGAAAAAGAATTCAGGATATTAAATATCGTATCGGCCAAGATTTGGGGTGGAGGCGAA 
CAATATGTCTATGATGTTTCAAAAGCATTGGGGCTTCGGGGCTGCACAATGTTTACCGCC 
GTCAATAAAAATAATGAATTGATGCACAGGCGATTTTCCGAAGTTTCTTCCGTTTTCACA 

30 ACGCGCCTTCACACGCTCAACGGGCTGTTTTCGCTCTACGCACTTACCCGCTTTATCCGG 
AAAAAC CG C AT T T C C C AC C T GAT GAT ACAC AC C G G C AAAAT TGCCGCCT TAT C CAT AC T T 
TTGAAAAAACTGACCGGGGTGCGCCTGATATTTGTCAAACATAATGTCGTCGCCAACAAA 
ACCGATTTTTACCACCGCCTGATACAGAAAAACACAGACCGCTTTATTTGCGTTTCCCGT 
CTGGTTTACGATGTGCAAACCGCCGACAATCCCTTTAAAGAAAAATACCGGATTGTTCAT 

35 AACGGTATCGATACCGGCCGTTTCCCTCCCTCTCAAGAAAAACCCGACAGCCGTTTTTTT 
ACCGTCGCCTACGCCGGCAGGATCAGTCCAGAAAAAGGATTGGAAAACCTGATTGAAGCC 
T GT G T GAT AC T G CAT C G GAAAT AT C C T C AAAT C AG GC T C AAAT T G GC AG GGG AC G GAC AT 
CCGGATTATATGTGCCGCCTGAAGCGGGACGTATCTGCTTCAGGAGCAGAACCATTTGTT 
TCTTTTGAAGGGTTTACCGAAAAACTTGCTTCGTTTTACCGCCAAAGCGATGTCGTGGTT 

40 TTGCCCAGCCTCGTCCCGGAGGCATTCGGTTTGTCATTATGCGAGGCGATGTACTGCCGA 
ACGGCGGTGATTTCCAATACTTTGGGGGCGCAAAAGGAAATTGTCGAACATCATCAATCG 
GGGATTCTGCTGGACAGGCTGACACCTGAATCTTTGGCGGACGAAATCGAACGCCTCGTC 
TTGAACCCTGAAACGAAAAACGCACTGGCAACGGCAGCTCATCAATGCGTCGCCGCCCGT 
TTTACCAT CAACCATACCGCCGACAAATTATTGGATGCAATATAA 

45 

NMB1705 Protein sequence 

MEKEFRILNIVSAKIWGGGEQYVYDVSKALGLRGCTMFTAVNKNNELMHRRFSEVSSVFT 
TRLHTLNGLFSLYALTRFIRKNRISHLMIHTGKIAALSILLKKLTGVRLIFVKHNVVANK 
TDFYHRLIQKNTDRFICVSRLVYDVQTADNPFKEKYRIVHNGIDTGRFPPSQEKPDSRFF 
50 T VAYAGRI S PEKGLENL I EAC VILHRKYPQI RLKLAGDGHP D YMCRLKRDVS AS GAEPFV 
SFEGFTEKLASFYRQSDVWLPSLVPEAFGLSLCEAMYCRTAVISNTLGAQKEIVEHHQS 
GILLDRLT PE SLADE I ERLVLNPETKNALATAAHQCVAARFT I NHTADKLLDAI 

NMB2065 Hemic protein DNA sequence 

55 ATGCAGGAACAGAATCGGAAACCAAGTTTTCCCATAGTGATGTTGCTGGTGTCGGTTGCC 
CTGTGGATAGCGTCTTTATCCAATGTTGCATTTTATTTGGGCAATCATGGAAGCATGGAG 
GGTTTGACCGTTTTGATTTTGGGGTCGATATTTGCTTCTTTGGATATCAGGTATTGTGCG 
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GTCTATGCGAATTATGTTTGGTTGGCGGCCATTGTTTTGCTGGCGTTGCGGAAGAAGGTC 
GTGCCTGTCCATGCGGCACTTTGGGGCTTGGCGTTGGTGGCTTTCAGTGTGAAAGCCGTA 
T AC G T C GAT G AAGC AG G G AAT AC AT C G GAT AT T G T GC G C T ACGG T GC AG GAT T T TAT T T G 
TGGTATGCCGCATTTGCGGTTGCCACCATCGGTACGTTTGCCGGAAAGAATAAGGAAAGA 
5 AAAGCCGCATCAGCGGCAGACGGGATAAAAATGACGTTTGATAAATGGTTGGGCTTGTCA 
AAACTGCCTAAAAATGAAGCAAGAATGCTGCTACAATATGTTTCGGAATATACGCGCGTG 
CAGTTGTTGACGCGGGGCGGGGAAGAAATGCCGGACGAAGTCCGACAGCGGGCGGACAGG 
CTGGCGCAACGCCGTCTGAACGGCGAGCCGGTTGCCTATATTTTAGGTGTGCGCGAATTT 
TATGGCAGACGCTTTACAGTCAATCCGAGCGTGCTGATTCCGCGCCCCGAAACCGAACAT 

10 TTGGTCGAAGCCGTATTGGCGCGCCTGCCCGAAAACGGGCGCGTGTGGGATTTGGGGACG 
GGCAGCGGCGCGGTTGCCGTAACCGTCGCGCTCGAACGCCCCGATGCGTTTGTGCGCGCA 
TCCGACATCAGCCCGCCCGCCCTTGAAACGGCGCGGAAAAATGCGGCGGATTTGGGCGCG 
CGGGTCGAATTTGCACACGGTTCGTGGTTCGACACCGATATGCCGTCTGAAGGGAAATGG 
GACATCATCGTGTCCAACCCGCCCTATATCGAAAACGGCGATAAACATTTGTTGCAAGGC 

15 GATTTGCGGTTTGAGCCGCAAATCGCGCTGACCGACTTTTCAGACGGCCTAAGCTGCATC 
CGCACCTTGGCGCAAGGCGCGCCCGACCGTTTGGCGGAAGGCGGTTTTTTATTGCTGGAA 
CACGGTTTCGATCAGGGCGCGGCGGTGCGCGGCGTGTTGGCGGAGAATGGTTTTTCAGGA 
GTGGAAACCCTGCCGGATTTGGCGGGTTTGGACAGGGTTACGCTGGGGAAGTATATGAAG 
CAT T T GAAAT AA 

20 

NMB2065 Protein sequence 

MOEQNRKPSFPIVMLLVSVALWIASLSNVAFYLGNHGSMEGLTVLILGSIFASLDIRYCA 
VYANYVWLAAIVLLALRKKVVPVHAALWGLALVAFSVKAVyVDEAGNTSDIVRYGAGFYL 
W YAAFAVAT I GT FAGKNKERKAASAADGI KMT FDKWLGLSKLPKNEARMLLQ YVSE YTRV 
25 QLLTRGGEEMPDEVRQRADRLAQRRLNGEPVAYILGVREFYGRRFTVNPSVLIPRPETEH 
LVEAVLARLPENGRVWDLGTGSGAVAVTVALERPDAFVRASDISPPALETARKNAADLGA 
RVEFAHGSWFDTDMPSEGKWDIIVSNPPYIENGDKHLLQGDLRFEPQIALTDFSDGLSCI 
RTLAQGAPDRLAEGGFLLLEHGFDQGAAVRGVLAENGFSGVETLPDLAGLDRVTLGKYMK 
HLK 

30 

Mutants selected by vacinee's 17 D sera (Screened once only) 
NMB0339 DNA sequence 

ATGGACAACGAATTGTGGATTATCCTGCTGCCGATTATCCTTTTGCCCGTCTTCTTCGCG 

35 ATGGGCTGGTTTGCCGCCCGCGTGGATATGAAAACCGTATTGAAGCAGGCAAAAAGCATC 
CCTTCGGGATTTTATAAAAGCTTGGACGCTTTGGTCGACCGCAACAGCGGGCGCGCGGCA 
AGGGAGTTGGCGGAAGTCGTCGACGGCCGGCCGCAATCGTATGATTTGAACCTCACCCTC 
G G C AAAC T T TAG CGCCAGCGTGGC G AAAAC GAC AAAG C CAT C AAC AT AC AC C G G AC AAT G 
CTCGATTCTCCCGATACGGTCGGCGAAAAGCGCGCGCGCGTCCTGTTTGAATTGGCGCAA 

40 AACTACCAAAGTGCGGGGTTGGTCGATCGTGCCGAACAGATTTTTTTGGGGCTGCAAGAC 
GGTA7VAATGGCGCGTGAAGCCAGACAGCACCTGCTCAATATCTACCAACAGGACAGGGAT 
TGGGAAAAAGCGGTTGAAACCGCCCGGCTGCTCAGCCATGACGATCAGACCTATCAGTTT 
GAAATCGCCCAGTTTTATTGCGAACTTGCCCAAGCCGCGCTGTTCAAGTCCAATTTCGAT 
GTCGCGCGTTTCAATGTCGGCAAGGCACTCGAAGCCAACAAAAAATGCACCCGCGCCAAC 

45 ATGATTTTGGGCGACATCGAACACCGACAAGGCAATTTCCCTGCCGCCGTCGAAGCCTAT 
GCCGCCATCGAGCAGCAAAACCATGCATACTTGAGCATGGTCGGCGAGAAGCTTTACGAA 
G C C TAT GC C GC GC AG GG AAAAC C T G AAG AAG G C T T G AAC C G T C T GAC AG GAT AT AT GC AG 
ACGTTTCCCGAACTTGACCTGATCAATGTCGTGTACGAGAAATCCCTGCTGCTTAAGTGC 
GAGAAAGAAGCCGCGCAAACCGCCGTCGAGCTTGTCCGCCGCAAGCCCGACCTTAACGGC 

50 GTGTACCGCCTGCTCGGTTTGAAACTCAGCGATATGAATCCGGCTTGGAAAGCCGATGCC 
GACATGATGCGTTCGGTTATCGGACGGCAGCTACAGCGCAGCGTGATGTACCGTTGCCGC 
AACTGCCACTTCAAATCCCAAGTCTTTTTCTGGCACTGCCCCGCCTGCAAC7\AATGGCAG 
ACGTTTACCCCGAATAAAATCGAAGTTTAA 

55 NMB0339 Protein sequence 

MDNELWI I LL P 1 1 LLP VFFAMGWFAARVDMKT VLKQAKS I P S G F YKS LDAL VDRN S GRAA 
RELAEVVDGRPQSYDLNLTLGKLYRQRGENDKAINIHRTMLDSPDTVGEKRARVLFELAQ 
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NYQSAGLVDRAEQIFLGLQDGKMAREARQHLLNIYQQDRDWEKAVETARLLSHDDQTYQF 
EIAQFYCELAQAALFKSNFDVARFNVGKALEANKKCTRANMILGDIEHRQGNFPAAVEAY 
AAIEQQNHAYLSMVGEKLYEAYAAQGKPEEGLNRLTGYMQTFPELDLINWYEKSLLLKC 
EKEAAQTAVELVRRKPDLNGVYRLLGLKLSDMNPAWKADADMMRSVIGRQLQRSVMYRCR 
5 NCHFKSQVFFWHCPACNKWQTFTPNKIEV 

Selection with patient's sera 



We have a collection of acute and convalescent sera available to us for screening. 
10 This is from individuals infected with different serogroup of TV. meningitidis. 

Screens have been performed with acute (A) or convalescent (C) sera. The period 
between the acute infection and collection of sera was from 2 weeks to 3 months. 



NMB0401 putA DNA sequence 

15 ATGTTTCATTTTGCATTTCCGGCACAAACTGCCCTGCGCCAAGCGATAACCGATGCCTAC 
C G C C G T AAT G AAAT C G AAG C C G TAG AG GAT AT G T T G C AAC G T GC AC AG AT GAG C G AC GAA 
GAGCGCAACGCCGCCTCCGAGCTTGCCCGCCGTTTGGTTACCCAAGTCCGCGCCGGCCGC 
ACCAAAGCCGGCGGCGTGGATGCGCTGATGCACGAGTTTTCACTCTCCAGCGAAGAAGGC 
ATCGCGCTGATGTGTCTGGCAGAAGCCCTGCTGCGTATCCCCGACAACGCCACGCGCGAC 

20 C G C C T GAT T GC C G AC AAG AT T T C AGAC GGC AAC T G G AAAAG C CAT T T GAAC AAC AG C C C T 
TCCCTCTTCGTCAATGCTGCCGCCTGGGGCCTGCTGATTACCGGCAAACTGACCGCCACA 
AACGACAAACAAATGAGTTCCGCACTCAGCCGCCTGATCAGCAAAGGCGGCGCACCGCTC 
ATCCGCCAAGGCGTAAATTACGCCATGCGGCTTCTGGGCAAACAGTTCGTAACCGGACAG 
ACCATTGAAGAAGCCCTGCAAAACGGCAAAGAACGCGAAAAAATGGGCTACCGCTTCTCC 

25 TTCGATATGTTGGGCGAAGCCGCCTACACCCAAGCCGATGCCGACCGCTACTACCGCGAC 
TATGTCGAAGCCATCCACGCCATCGGCAAAGATGCGGCAGGACAAGGCGTTTACGAAGGT 
AACGGTATTTCCGTCAAACTTTCCGCCATCCATCCGCGCTACTCGCGCACCCAACACGGC 
CGCGTGATGGGCGAACTGTTGCCGCGCCTGAAAGAGCTGTTCCTTTTGGGTAAAAAATAC 
GATATCGGTATCAACATCGATGCCGAAGAAGCCAACCGTCTGGAGCTGTCTTTGGATTTG 

30 ATGGAGGCTTTGGTTTCAGACCCTGACTTGGCTGGCTACAAAGGTATCGGTTTCGTTGTC 
CAAGCCTACCAAAAACGTTGTCCGTTCGTTATCGACTACCTGATCGACCTTGCCCGCCGC 
AACAACCAAAAACTAATGATCCGCCTCGTCAAAGGCGCGTATTGGGACAGCGAAATCAAA 
TGGGCGCAAGTGGACGGCTTGAACGGCTATCCGACCTACACCCGCAAAGTCCACACCGAC 
ATCTCCTACCTCGCCTGCGCGCGCAAACTGCTTTCCGCGCAAGACGCGGTATTCCCGCAA 

35 TTTGCCACCCACAACGCCTACACTTTGGGCGCAATCTACCAAATGGGTAAAGGCAAAGAT 
TTTGAACACCAATGCCTGCACGGTATGGGCGAAACCCTGTACGACCAAGTCGTCGGCCCG 
CAAAACTTAGGCCGCCGCGTGCGCGTGTACGCCCCAGTCGGCACACACGAAACCCTGCTC 
GCCTACTTGGTGCGCCGCCTGTTGGAAAACGGCGCGAACTCGTCTTTCGTCAACCAAATC 
G T C G AT G AAAAC A T C AG CAT C G AC AC GC T CAT C C G C AG C C C G T T C G AC AC CAT C G C C GAA 

40 CAAGGCATCCACCTGCACAACGCCCTGCCGCTGCCGCGCGATTTGTACGGCAAATGCCGT 
CTGAACTCGCAAGGCGTGGACTTGAGCAACGAAAACGTATTGCAGCAGCTTCAAGAACAG 
ATGAACAZIAGCCGCCGCGCAAGACTTCCACGCCGCATCCATCGTCAACGGCAAAGCCCGC 
GATGTCGGCGAAGCGCAACCGATTAAAAACCCTGCCGACCACGACGACATCGTCGGCACA 
GTCAGCTTTGCCGATGCCGCGCTTGCCCAAGAAGCGGTTGGCGCAGCCGTTGCCGCGTTC 

45 CCCGAATGGAGTGCGACACCTGCCGCCGAACGCGCCGCCTGCCTGCGCCGTTTTGCCGAT 
TTGCTGGAGCAGCACACCCCAGCACTGATGATGCTTGCCGTGCGCGAAGCAGGCAAAACG 
CTGAACAACGCCATTGCCGAAGTGCGCGAAGCCGTCGATTTCTGCCGCTACTACGCAAAC 
GAAGCCGAACATACCCTGCCTCAAGACGCAAAAGCCGTCGGCGCGATTGTCGCCATCAGC 
CCGTGGAACTTCCCGCTCGCCATCTTTACCGGCGAAGTCGTTTCCGCATTGGCGGCAGGC 

50 AACACCGTCATCGCCAAACCCGCCGAACAAACCAGCCTGATTGCCGGTTATGCCGTTTCC 
CTCATGCACGAAGCCGGCATCCCGACTTCCGCCCTGCAACTCGTCCTCGGCGCAGGCGAC 
GTGGGTGCGGCATTGACCAACGATGCCCGCATCGGCGGCGTGATTTTCACCGGCTCGACC 
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GAAGTGGCGCGCCTGATCAACAAAGCCCTTGCCAAACGCGGCGACAATCCCGTCCTGATT 
GCCGAAACCGGCGGACAAAACGCCATGATTGTCGATTCCACCGCACTTGCCGAGCAAGTC 
TGCGCCGACGTATTGAACTCCGCCTTCGACAGCGCGGGACAACGCTGCTCCGCCCTGCGC 
ATTTTGTGCGTCCAAGAAGACGTTGCCGACCGTATGCTCGACATGATCAAAGGCGCTATG 
5 GACGAACTCGTCGTCGGCAAACCGATTCAGCTCACTACCGATGTCGGCCCCGTCATCGAT 
G C C G AAGC AC AGC AAAACC T G T T GAAC C ACAT C AAC AAAAT GAAAG G T G T T GC C AAG T C C 
TACCACGAAGTCAAAACCGCCGCCGATGTCGATTCCAAAAAATCCACGTTCGTTCGCCCC 
ATCCTGTTTGAATTGAACAACCTCAACGAACTGCAACGCGAAGTCTTCGGTCCCGTCCTG 
CACGTCGTCCGCTACCGCGCCGACGAACTCGACAACGTCATCGACCAAATCAACAGCAAA 

10 GGCTACGCCCTGACCCACGGCGTACACAGCCGCATCGAAGGCACGGTACGCCACATCCGC 
AGCCGCATCGAAGCCGGCAACGTTTACGTCAACCGCAACATCGTCGGCGCAGTCGTCGGC 
GTACAGCCCTTCGGCGGACACGGTCTGTCCGGCACAGGCCCCAAAGCAGGCGGTTCGTTC 
TACCTGCAAAAACTGACCCGCGCCGGCGAATGGGTTGCCCCGACCCTGAGCCAAATCGGA 
CAGGCGGACGAAGCCGCACTCAAACGCCTCGAAGCACTGGTTCACAAACTACCGTTCAAC 

15 GCCGAAGAGAAAAAAGCCGCAGCGGCCGCTTTGGGACACGCCCGCATCCGCACCCTGCGC 
CGTGCCGAAACCGTCCTTACCGGACCGACCGGCGAGCGCAACAGCATCTCATGGCACGCG 
CCCAAACGCGTTTGGATACACGGCGGCAGCACGGTTCAAGCCTTTGCCGCACTGACCGAA 
CTTGCCGCCTCCGGCATACAGGCAGTGGTCGAACCCGACAGCCCCTTGGCTTCCTACACT 
GCCGACTTGGAAGGTCTGCTGCTGGTCAACGGCAAACCCGAAACCGCCGGCATCAGCCAC 

20 GTTGCCGCCCTGTCGCCTTTGGACAGCGCGCGCAAACAGGAACTTGCCGCCCACGACGGC 
GCACTCATCCGCATCCTCCCTTCGGAAAACGGACTCGACATCCTGCAAGTGTTTGAAGAA 
ATCTCTTGCAGCGTCAACACCACAGCCGCCGGCGGCAACGCCAGCCTGATGGCGGTCGCC 
GACTGA 

25 NMB0401 Protein sequence 

MFHFAFPAQTALRQAITDAYRRNEIEAVQDMLQRAQMSDEERNAASELARRLVTQVRAGR 
T KAGG VDALMHE FS L S S EEG I ALMCLAE ALLRI P DNAT RDRL I ADKI S DGNWKS HLNNS P 
S L F VNAAAWGLL I T GKLT ATN DKQMS S ALS RL I S KGGAPL I RQG VN YAMRLLGKQ FVT GQ 
T I EEALQNGKEREKMG YRFS FDMLGEAAYTQADADRY YRD YVEAI HAI GKDAAGQGVYEG 

30 NGISVKLSAIHPRYSRTQHGRVMGELLPRLKELFLLGKKYDIGINIDAEEANRLELSLDL 
ME AL VS D P DLAG YKGI GFVVQAYQKRC P FVT D YL I DL ARRNNQKLMI RL VKGAYWD S E I K 
WAQ VDGLNG YP T YTRKVH T D I S YLAC ARKLL S AQ DAVF PQ FATHNAY TLGAI YQMGKGKD 
FEHQCLHGMGETLYDQVVGPQNLGRRVRVYAPVGTHETLLAYLVRRLLENGANSSFVNQI 
VDENISIDTLIRSPFDTIAEQGIHLHNALPLPRDLYGKCRLNSQGVDLSNENVLQQLQEQ 

35 MNKAAAQDFHAASIVNGKARDVGEAQPIKNPADHDDIVGTVSFADAALAQEAVGAAVAAF 
PEWSATPAAERAACLRRFADLLEOHTPALMMLAVREAGKTLNNAIAEVREAVDFCRYYAN 
EAEHTLPQDAKAVGAIVAISPWNFPLAIFTGEVVSALAAGNTVIAKPAEQTSLIAGYAVS 
LMHE AGI P T S ALQLVLGAGDVGAALTNDARI GGVI FT G S TE VARL I NKAL AKRG DN P VL I 
AETGGQNAMIVDSTALAEQVCADVLNSAFDSAGQRCSALRILCVQEDVADRMLDMIKGAJyi 

40 DEL VVGKP I QLT T D VG P VI DAE AQQNLLNH INKMKG VAKS YHE VKTAAD VD S KKS T FVRP 
I LFELNNLNELQRE VFGPVLH VVRYRADELDNVI DQINS KG YALTHGVHS RI EGT VRHI R 
SRIEAGNVYVNRNIVGAVVGVQPFGGHGLSGTGPKAGGSFYLQKLTRAGEWVAPTLSQIG 
QADEAALKRLEALVHKL P FNAEEKKAAAAALGHARI RT LRRAE T VLT G PT GERN S I S WHA 
PKRVWIHGGSTVQAFAALTELAASGIQAVVEPDSPLASYTADLEGLLLVNGKPETAGISH 

45 VAALSPLDSARKQELAAHDGALIRILPSENGLDILQVFEEISCSVNTTAAGGNASLMAVA 
D 

NMB1335 CreA 



50 DNA and Protein sequences given above 



NMB1467 PPX DNA sequence 

ATGACCACCACCCCCGCAAACGTCCTCGCCTCCGTCGATTTGGGTTCCAACAGTTTCCGC 
C T C C AG AT T T GC G AAAAC AAC AACGGAC AAT T AAAAG T CAT C GAT T CG T T C AAAC AG AT G 
55 GTGCGCTTCGCCGCCGGACTGGACGAACAGAAAAATCTGAGTGCCGCTTCCCAAGAACAG 
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GCTTTGGACTGTCTGGCAAAATTCGGCGAACGCCTGCGCGGCTTCCGCCCTGAACAGGTA 
CGCGCCGTGGCAACCAACACATTCCGCGTTGCCAAAAACATCGCAGATTTCCTTCCCAAA 
GCCGAAGCGGCATTGGGTTTCCCCATCGAAATCATCGCCGGGCGCGAAGAGGCGCGGCTG 
ATTTATACCGGCGTGATCCACACCCTCCCCCCGGGCGGCGGCAAAATGCTGGTTATCGAC 
5 ATCGGCGGCGGTTCGACAGAATTTGTCATCGGCTCGACGCTGAATCCCGACATTACCGAA 
AGCCTGCCCTTGGGCTGCGTAACCTACAGCCTGCGCTTCTTCCAAAACAAAATCACCGCC 
AAAGACTTCCAATCTGCCATTTCCGCCGCCCGCAACGAAATCCAGCGTATCAGCAAAAAT 
ATGAGGCGCGAAGGTTGGGATTTCGCCGTCGGCACATCGGGTTCGGCAAAATCCATCCGC 
GACGTGCTTGCCGCCGAAATGCCCCAAGAGGCGGACATTACCTACAAAGGCATGCGCGCC 

10 CTCGCCGAACGCATCATCGAAGCCGGTTCGGTCAAAAAAGCCAAATTTGAAAACCTGAAA 
CCGGAACGCATCGAAGTTTTTGCCGGCGGACTTGCCGTGATGATGGCGGCGTTTGAGGAA 
ATGAAACTCGACAGGATGACCGTAACCGAAGCCGCCCTGCGCGACGGCGTGTTTTACGAT 
TTGATCGGGCGCGGTTTAAACGAAGATATGCGCGGACAAACGGTTGCCGAGTTCCAACAC 
CGCTACCACGTCAGCCTCAATCAGGCGAAACGCACCGCCGAGACCGCGCAAACCTTTATG 

15 GACAGCCTCTGCCACGCTAAAAACGTTACAGTTCAAGAGCTTGCCTTGTGGCAACAGTAT 
CTCGGACGCGCCGCCGCGCTGCACGAAATCGGTTTGGACATCGCCCACACCGGCTATCAC 
AAGCATTCCGCCTACATCCTCGAAAACGCCGATATGCCGGGTTTCTCACGCAAAGAACAG 
ACCATACTTGCCCAACTGGTCATCGGTCATCGCGGCGATATGAAAAAAATGAGCGGCATC 
ATCGGCACCAACGAAATGTTGTGGTATGCCGTTTTGTCCCTGCGCCTTGCCGCACTGTTC 

20 TGCCGTTCGCGCCAAGACCTGTCTTTCCCGAAAAATATGCAGTTGCGCACGGATACGGAA 
AGCTGCGGCTTCATCCTGCGTATTGACAGGGAATGGCTGGAACGCCATCCCCTGATTGCC 
GACGCATTGGAATATGAAAGCGTCCAATGGCAAAAAATCAATATGCCGTTCAAAGTCGAG 
GCCGTCTGA 

25 NMB 1467 Protein sequence 

MTTTPANVLASVDLGSNSFRLQICENNNGQLKVIDSFKOMVRFAAGLDEQKNLSAASQEQ 
ALDCLAKFGERLRGFRPEQVRAVATNTFRVAKNIADFLPKAEAALGFPIEIIAGREEARL 
IYTGVIHTLPPGGGPCMLVIDIGGGSTEFVIGSTLNPDITESLPLGCVTYSLRFFQNKITA 
KDFQSAISAARNEIQRISKNMRREGWDFAVGTSGSAKSIRDVLAAEMPQEADITYKGMRA 
30 LAERIIEAGSVKKAKFENLKPERIEVFAGGLAVMMAAFEEMKLDRMTVTEAALRDGVFYD 
LIGRGLNEDMRGQTVAEFQHRYHVSLNQAKRTAETAQTFMDSLCHAKNVTVQELALWQQY 
LGRAAALHEIGLDIAHTGYHKHSAYILENADMPGFSRKEQTILAQLVIGHRGDMKKMSGI 
I G TNEMLWYAVLS LRLAAL FCRS RQDLS FPKNMQLRT DTESCGFILRI DREWLERH PL I A 
DALEYESVQWQKINMPFKVEAV 

35 

NMB2056 HemK 

ATGAACGGTAAATACTACTACGGCACAGGCCGCCGCAAAAGTTCAGTGGCTCGTGTATTC 
CTGATTAAAGGTACAGGTCAAATCATCGTAAACGGTCGTCCCGTTGACGAATTCTTCGCA 
40 CGGGAAACCAGCCGAATGGTTGTTCGCCAACCCTTGGTTCTGACTGAAAACGCCGAATCT 
TTCGACATCAAAGTCAATGTTGTTGGCGGCGGCGAAACCGGCCAGTCCGGCGCAATCCGC 
CACGGCATTACCCGTGCCCTGATCGACTTCGATGCCGCGTTGAAACCCGCCTTGTCTCAA 
GCTGGTTTTGTTACCCGCGATGCCCGCGAAGTCGAACGTAAAAAACCGGGTCTGCGCAAA 
GCACGCCGTGCAAAACAATTCTCCAAACGTTAA 

45 

NMB2056 Protein sequence 

MNGKYYYGTGRRKSSVARVFLIKGTGQIIVNGRPVDEFFARETSRMVVRQPLVLTENAES 
FDIBCVNWGGGETGQSGAIRHGITRALIDFDAALKPALSQAGFVTRDAREVERKKPGLRK 
ARRAKQFSKR 

50 

NMB0808 DNA sequence 

ATGTCCGCCCTCCTCCCCATCATCAACCGCCTGATTCTGCAAAGCCCGGACAGCCGCTCG 
GAACTTGCCGCCTTTGCAGGCAAAACACTGACCCTGAACATTGCCGGGCTGAAACTGGCG 
GGACGCATCACGGAAGACGGTTTGCTCTCGGCGGGAAACGGCTTTGCAGACACCGAAATT 
55 ACCTTCCGCAACAGCGCGGTACAGAAAATCCTCCAAGGAGGCGAACCCGGGGCGGGCGAC 
ATCGGGCTCGAAGGCGACCTCATCCTCGGCATCGCGGTACTGTCCCTGCTCGGCAGCCTG 
CGTTCCCGCGCATCGGACGAATTGGCACGGATTTTCGGCACGCAGGCAGACATCGGCAGC 
CGTGCCGCC GACAT CGG AC AC GGC AT C AAACAAAT C GGC AG GAAC AT C G C C G AAC AAAT C 
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GGCGGATTTTCCCGCGAATCCGAGTCCGCAAACATCGGCAACGAAGCCCTTGCCGACTGC 
CTCGACGAAATAAGCAGACTGCGCGACGGCGTGGAACGCCTCAACGAACGCCTCGACCGG 
C T C G AAC G C G AC AT T T G GAT AG AC T AA 

5 NMB0808 Protein sequence 

MSALLP I INRLILQS PDSRSELAAFAGKTLTLNIAGLKLAGRI TEDGLLSAGNGFADTEI 
TFRNSAVQKILQGGEPGAGDIGLEGDLILGIAVLSLLGSLRSRASDELARI FGTQADIGS 
RAADIGHGIKQIGRNIAEQIGGFSRESESANIGNEALADCLDEISRLRDGVERLNERLDR 
LERDIWID 

10 

NMB0774 upp DNA sequence 

ATGAACGTTAATGTTATCAACCATCCGCTCGTCCGCCACAAATTAACCCTGATGAGGGAG 
GCGGATTGCAGCACCTACAAATTCCGGACGCTTGCCACCGAGCTGGCGCGCCTGATGGCA 
TACGAGGCAAGCCGTGATTTTGAAATCGAAAAATACCTTATCGACGGATGGTGCGGTCAG 

15 ATTGAAGGCGACCGCATCAAGGGCAAAACATTGACCGTCGTTCCCATACTGCGTGCAGGT 
TTGGGTATGCTTGACGGTGTGCTCGACCTGATTCCGACTGCCAAAATCAGTGTAGTCGGA 
CTGCAGCGCGACGAAGAAACGCTGAAGCCTATTTCCTATTTTGAGAAATTTGTGGACAGT 
ATGGACGAACGTCCGGCTTTGATTATCGATCCTATGCTGGCGACAGGCGGTTCGATGGTT 
GCCACCATCGACCTTTTGAAAGCCAAGGGCTGCAAAAATATCAAGGCACTGGTGCTGGTT 

20 GCCGCGCCCGAGGGTGTGAAGGCGGTCAACGACGCGCACCCTGACGTTACGATTTACACC 
GCCGCGCTCGACAGCCACTTGAACGAGAACGGCTACATCATCCCCGGCTTGGGCGATGCG 
GGCGACAAGATTTTCGGCACGCGCTAA 

NMB0774 Protein sequence 

25 MNVNVI NH PL VRHKL TLMRE ADC S T YKFRTLAT ELARLMAYE AS RD FE I EK YL I DGWCGQ 
IEGDRIKGKTLTWPILRAGLGMLDGVLDLIPTAKISVVGLQRDEETLKPISYFEKFVDS 
MDE RP AL 1 1 D PML AT G G S M VAT I DLLKAKG CKN I KAL VL VAAP E G VKAVN DAH P DVT I Y T 
AALDS HLNENG YI I PGLGDAGDKI FGTR 

30 NMA0078 putative integral membrance protein DNA sequence 

TTGGCGTTTACTTTAATGCGTCGCGCCATGATACGTAAAATGCCCTATACGGAAGATATG 
CGCCCAGGCGATACCGCTAATCCTTATGGTGCGTCCAAAGCGATGGTGGAACGGATGTTA 
ACCGACATCCAAAAAGCCGATCCGCGCTGGAGCATGATTTTGTTGCGTTATTTCAATCCG 
ATTGGCGCGCATGAAAGCGGCTTGATTGGCGAGCAGCCAAACGGCATCCCGAATAATTTG 

35 TTGCCTTATATCTGCCAAGTGGCGGCAGGCAAACTGCCGCAATTGGCGGTATTTGGCGAT 
GACTACCCTACCCCCGACGGCACGGGGATGCGTGACTATATTCATGTGATGGATTTGGCA 
GAAGGCCATGTCGCGGCTATGCAGGCAAAAAGTAATGTAGCAGGCACGCATTTGCTGAAC 
TTAGGCTCCGGCCGCGCTTCTTCGGTGTTGGAAATCATCCGCGCATTTGAAGCAGCTTCG 
GGTTTGACGATTCCGTATGAAGTCAAACCGCGCCGTGCCGGTGATTTGGCGTGCTTCTAT 

40 GCCGACCCTTCCTATACAAAGGCGCAAATCGGCTGGCAAACCCAGCGTGATTTAACCCAA 
AT GAT GG AAG AC T CAT GGCGCTGGGT GAG T AAT AAT C C G AAT G G C TAG GAC GAT T AA 

NMA0078 Protein sequence 

MAFTLMRRAMIRKMPYTEDMRPGDTANPYGASK 
45 IGAHESGLIGEQPNGIPNNLLPYICQVAAGKLPQLAVFGDDYPTPDGTGMRDYIHVMDLA 
EGHVAAMQAKSNVAGTHLLNLGSGRASSVLEIIRAFEAASGLTIPYEVKPRRAGDLACFY 
ADPSYTKAQIGWQTQRDLTQMMEDSWRWVSNNPNGYDD 

NMB0337 Branched-chain amino acid aminotransferase DNA sequence 

50 ATGAGCAGACCCGTACCCGCCGTATTCGGCAGCGTTTTTCACAGTCAAATGCCCGTCCTC 
GCCTACCGCGAAGGCAAATGGCAGCCGACCGAATGGCAATCTTCCCAAGACCTCTCCCTC 
GCACCGGGCGCGCACGCCCTGCACTACGGCAGCGAATGTTTCGAGGGACTGAAAGCCTTC 
CGTCAGGCAGACGGCAAAATCGTGCTGTTCCGTCCGACTGCCAATATCGCGCGTATGCGG 
CAAAGTGCGGACATTTTGCACCTGCCGCGCCCCGAAACCGAAGCTTATCTTGACGCGCTA 

55 ATCAAATTGGTCAAACGTGCCGCCGATGAAATTCCCGATGCGCCTGCCGCCCTGTACCTG 
CGTCCGACCTTAATCGGTACCGATCCCGTTATCGGCAAGGCCGGTTCTCCTTCCGAAACC 
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GCCCTGCTGTATATTTTGGCTTCCCCCGTCGGCGACTATTTCAAAGTCGGATCGCCCGTC 
AAAATTTTGGTGGAAACCGAACACATCCGCTGCGCCCCGCATATGGGCCGCGTCAAATGC 
GGCGGCAACTACGCTTCCGCCATGCACTGGGTGCTGAAGGCGAAAGCCGAATATGGCGCA 
AATCAAGTCCTGTTCTGCCCGAACGGCGACGTGCAGGAAACCGGCGCGTCCAACTTTATC 
CTGATTAACGGCGATGA^ATCATTACCAAACCGCTGACCGACGAGTTTTTGCACGGCGTA 
ACCCGCGATTCCGTACTGACGGTTGCCAAAGATTTGGGCTATACCGTCAGCGAACGCAAT 
TTCACGGTTGACGAACTCAAAGCTGCGGTGGAAAACGGTGCGGAAGCCATTTTGACCGGT 
ACGGCAGCCGTCATCTCGCCCGTTACTTCCTTCGTCATCGGCGGCAAAGAAATCGAAGTG 
AAAAGCCAAGAACGCGGCTATGCCATCCGTAAGGCGATTACCGACATCCAGTATGGTTTG 
GCGGAAGACAAATACGGCTGGCTGGTTGAAGTGTGCTGA 

NMB0337 Protein sequence 

MSRPVPAVFGSVFHSQMPVLAYREGKWQPTEWQSSQDLSLAPGAHALHYGSECFEGLPCAF 
RQADGKIVLFRPTANIARMRQSADILHLPRPETEAYLDALIKLVKRAADEIPDAPAALYL 
RPTLIGTDPVI GKAGS PSETALLYILAS PVGDYFKVGS PVKILVETEHIRCAPHMGRVKC 
GGNYASAMHWVLKAKAEYGANQVLFCPNGDVQETGASNFILINGDEIITKPLTDEFLHGV 
TRDSVLTVAKDLGYTVSERNFTVDELKAAVENGAEAILTGTAAVISPVTSFVIGGKEIEV 
KSQERGYAIRKAITDIQYGLAEDKYGWLVEVC 

NMB0191 Par A family protein DNA sequence 

ATGAGTGCGAACATCCTTGCCATCGCC7^ATCAGAAGGGCGGTGTGGGCAAAACGACGACG 
ACGGTAAATTTGGCGGCTTCGCTGGCATCGCGCGGCAAACGCGTGCTGGTGGTCGATTTG 
GATCCGCAGGGCAATGCGACGACGGGCAGCGGCATCGACAAGGCGGGTTTGCAGTCCGGC 
GTTTATCAGGTCTTATTGGGCGATGCGGACGTGCAGTCGGCGGCGGTACGCAGCAAAGAG 
GGCGGATACGCTGTGTTGGGTGCGAACCGCGCGCTGGCCGGCGCGGAAATCGAACTGGTG 
CAGGAAATCGCCCGGGAAGTGCGTTTGAAAAACGCGCTCAAGGCAGTGGAAGAAGATTAC 
GACTTTATCCTGATCGACTGCCCGCCTTCGCTGACGCTGTTGACGCTTAACGGGCTGGTG 
GCGGCGGGCGGCGTGATTGTGCCGATGTTGTGCGAATATTACGCGCTGGAAGGGATTTCC 
GATTTGATTGCGACCGTGCGCAAAATCCGTCAGGCGGTCAATCCCGATTTGGACATCACG 
GGCATCGTGCGCACGATGTACGACAGCCGCAGCAGGCTGGTTGCCGAAGTCAGCGAACAG 
TTGCGCAGCCATTTCGGGGATTTGCTTTTTGAAACCGTCATCCCGCGCAATATCCGCCTT 
GCGGAAGCGCCGAGCCACGGTATGCCGGTGATGGCTTACGACGCGCAGGCAAAGGGTACC 
AAGGCGTATCTTGCCTTGGCGGACGAGCTGGCGGCGAGGGTGTCGGGGAAATAG 

NMB0191 Protein sequence 

MSANILAIANQKGGVGKTTTTVNLAASLASRGKRVLVVDLDPQGNATTGSGIDKAGLQSG 
VYQVLLGDADVQ SAAVRS KE G G YAVLG ANRAL AGAE I ELVQE I AREVRLKNALKAVEE D Y 
DFILIDCPPSLTLLTLNGLVAAGGVIVPMLCEYYALEGISDLIATVRKIRQAVNPDLDIT 
GIVRTMYDSRSRLVAEVSEQLRSHFGDLLFETVIPRNIRLAEAPSHGMPVMAYDAQAKGT 
KAYLALADELAARVS GK 

NMB1710 Glutamate dehydrogenase(gdhA) DNA sequence 

ATGACTGACCTGAACACCCTGTTTGCCAACCTCAAACAACGCAATCCCAATCAGGAGCCG 
TTCCATCAGGCGGTTGAAGAAGTCTTCATGAGTCTCGATCCGTTTTTGGCAAAAAATCCG 
AAATACACCCAGCAAAGCCTGCTGGAACGCATCGTCGAACCCGAACGCGTCGTGATGTTC 
CGCGTAACCTGGCAGGACGATAAAGGGCAAGTCCAAGTCAACCGGGGCTACCGCGTGCAA 
ATGAGTTCCGCCATCGGTCCTTACAAAGGCGGCCTGCGCTTCCATCCGACCGTCGATTTG 
GGCGTATTGAAATTCCTCGCTTTTGAACAAGTGTTCAAAAACGCCTTGACCACCCTGCCT 
ATGGGCGGCGGCAAAGGCGGTTCCGACTTCGACCCCAAAGGCAAATCCGATGCCGAAGTA 
ATGCGCTTCTGCCAAGCCTTTATGACCGAACTCTACCGCCACATCGGCGCGGACACCGAT 
GTTCCGGCCGGCGACATCGGCGTAGGCGGGCGCGAAATCGGCTACCTGTTCGGACAATAC 
AAAAAAATCCGCAACGAGTTTTCTTCCGTCCTGACCGGCAAAGGTTTGGAATGGGGCGGC 
AGCCTCATCCGTCCCGAAGCGACCGGCTACGGCTGCGTCTATTTCGCCCAAGCGATGCTG 
CAAACCCGCAACGATAGTTTTGAAGGCAAACGCGXCCTGATTTCCGGCTCCGGC7\ATGTG 
GCGCAATACGCCGCCGAAAAAGCCATCCAACTGGGTGCGAAAGTACTGACCGTTTCCGAC 
TCCAACGGCTTCGTCCTCTTCCCCGACAGCGGTATGACCGAAGCGCAACTCGCCGCCTTG 
ATCGAATTGAAAGAAGTCCGCCGCGAACGCGTTGCCACCTACGCCAAAGAGCAAGGTCTG 
CAATACTTTGAAAAACAAAAACCGTGGGGCGTCGCCGCCGAAATCGCCCTGCCCTGCGCG 
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ACCCAGAACGAATTGGACGAAGAAGCCGCCAAAACCCTGTTGGCAAACGGCTGCTACGTC 
GTTGCCGAAGGTGCGAATATGCCGTCGACTTTGGGCGCGGTCGAGCAATTTATCAAAGCC 
GGCATCCTCTACGCCCCGGGAAAAGCCTCCAATGCCGGCGGCGTGGCAACTTCAGGTTTG 
G AAAT GAG C C AAAAC GCC AT CCGCCTGTCTT GG AC T C G T G AAG AAG T C G ACC AAC G C C T G 
5 T T C G G CAT CAT GC AAAGC AT C C AC G AAT C C T G T C T G AAAT AC G G C AAAG T C G G C G AC AC A 
GTAAACTACGTCAATGGTGCGAACATTGCCGGTTTCGTCAAAGTTGCCGATGCGATGCTG 
GCGCAAGGCTTCTAA 

NMB 1710 Protein sequence 

10 MTDLNTLFANLKQRNPNQEPFHQAVEEVFMSLDPFLAKNPKYTQQSLLERIVEPERWMF 
RVTWQDDKGQVQVNRGYRVQMSSAIGPYKGGLRFHPTVDLGVLKFLAFEQVFKNALTTLP 
MGGGKGGSDFDPKGKSDAEVMRFCQAFMTELYRHIGADTDVPAGDIGVGGREIGYLFGQY 
KKIRNEFSSVLTGKGLEWGGSLIRPEATGYGCVYFAQAMLQTRNDSFEGKRVLISGSGNV 
AQYAAEKAIQLGAKVLTVSDSNGFVLFPDSGMTEAQLAALIELKEVRRERVATYAKEQGL 

15 QYFEKQKPWGVAAEIALPCATQNELDEEAAKTLLANGCYVVAEGANMPSTLGAVEQFIKA 
G ILYAPGKASNAGGVAT S GLEMS QNAI RLSWTREEVDQRL FGIMQS I HES CLKYGKVGDT 
VN Y VN GAN I AG F VKVADAML AQ G F 

NMB0062 Glucose- 1 -phosphate thymidylytransferase(rfbA-l) DNA sequence 

20 ATG71AAGGCATCATACTGGCAGGCGGCAGCGGCACGCGCCTCTACCCCATCACGCGCGGC 
GTATCCAAACAGCTCCTGCCCGTGTACGACAAACCGATGATTTATTACCCCTTGTCGGTT 
TTGATGCTGGCGGGAATCCGCGATATTTTGGTGATTACCGCGCCTGAAGACAACGCCTCT 
TTCAAACGCCTGCTTGGCGACGGCAGCGATTTCGGCATTTCCATCAGTTATGCCGTGCAA 
CCCAGTCCGGACGGCTTGGCACAGGCATTTATCATCGGCGAAGAATTTATCGGCAACGAC 

25 AATGTTTGCTTGGTTTTGGGCGACAATATTTTTTACGGTCAGTCGTTTACGCAAACATTG 
AAACAGGCGGCAGCGCAAACGCACGGCGCAACCGTGTTTGCTTATCAGGTCAAAAACCCC 
GAACGTTTCGGCGTGGTTGAATTTAACGAAAACTTCCGCGCCGTTTCCATCGAAGAAAAA 
CCGCAACGGCCCAAATCCGATTGGGCGGTAACCGGCTTGTATTTCTACGACAACCGCGCC 
GTCGAGTTCGCCAAACAGCTCAAACCGTCCGCACGCGGCGAATTGGAAATTACCGACCTC 

30 AACCGGATGTATTTGGAAGACGGCTCGCTCTCCGTTCAAATATTGGGACGCGGTTTCGCG 
TGGCTGGACACCGGCACCCACGAGAGCCTGCACGAAGCCGCTTCATTCGTCCAAACCGTG 
CAAAATATCCAAAACCTGCACATCGCCTGCCTCGAAGAAATCGCTTGGCGCAACGGTTGG 
CTTTCCGATGAAAAACTGGAAGAATTGGCGCGCCCGATGGCGAAAAACCAATACGGCCAA 
TATTTGCTGCGCCTGTTGAAAAAATAA 

35 

NMB0062 Protein sequence 

MKGIILAGGSGTRLYPITRGVSKQLLPVYDKPMIYYPLSVLMLAGIRDILVITAPEDNAS 
FKRLLGDGS DFGI SIS YAVQPS PDGLAQAFI I GEE FI GNDNVCLVLGDNI FYGQS FTQTL 
KQAAAQTHGATVFAYQVKNPERFGVVEFNENFRAVSIEEKPQRPKSDWAVTGLYFYDNRA 
40 VEFAKQLKPSARGELEITDLNRMYLEDGSLSVQILGRGFAWLDTGTHESLHEAAS FVQTV 
QNI QNLHI ACLEE I AWRNGWL S DEKLEELARPMAKNQ YGQ YLLRLLKK 

NMB 15 83 Imidazoleglycerol-phosphate dehydratase(hisB) DNA sequence 

AT G AAT T T GAC T AAAAC AC AAC G C C AAC T GC ACAAC T T T C T G AC C C T C GC C C AAG AAGC A 
45 GGTTCGCTGTCCAAGCTCGCCAAACTCTGCGGCTACCGTACCCCCGTCGCACTCTACAAA 
C T C AAAC AAC G C C T T G AAAAG C AG G C AG AAG AC C C AG AT G C AC G C G G CAT C C G T C C C AG C 
CTGATGGCATUVACTCGAT^AAlCACACCGGCAAACCCAAAGGCTGGCTCGACAGAAAACAC 
CGCGAACGCACTGTCCCCGAAACCGCCGCAGAAAGCACCGGAACTGCCGAAACCCAAATT 
GCCGAAACCGCATCTGCTGCCGGCTGCCGCAGCGTTACCGTCAACCGCAATACCTGCGAA 
50 ACCCAAATCACCGTCTCCATCAACCTCGACGGCAGCGGCAAAAGCAGGCTGGATACCGGC 
G T AC CCTTCCTC G AAC AC AT GAT C GAT C AAAT C G C C C G C C AC G G CAT GAT T GAC AT C GAC 
AT C AGC T G CAAAG G C GAC C T GCACAT C GAC GAC C AC CACAC C GC C GAAGAC AT C G G CAT C 
AC AC T C G GAC AAG C AAT C C G GC AG GC AC T C GGC G AC AAAAAAG G CAT C C G C C G T T AC G G A 
CATTCCTACGTCCCGCTCGACGAAGCCCTCAGCCGCGTCGTCATCGACCTTTCCGGCCGC 
55 CCCGGACTCGTGTACAACATCGAATTTACCCGCGCACTAATCGGACGTTTCGATGTCGAT 
T T G T T T G AAGAAT T T T T C C AC G GC AT C G T C AACC AC AG TAT GAT GAC C C T GCACAT C GAC 
AACCTCAGCGGCAAAAACGCCCACCATCAGGCGGAAACCGTATTCAAAGCCTTCGGGCGC 
GCCCTGCGTATGGCAGTCGAACACGACCCGCGCATGGCAGGACAGACCCCCTCGACCAAA 
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GGCACGCTGACCGCATAA 

NMB1583 Protein sequence 

MNliT KTQRQLHN fltlaqeag sl sklaklcgyrt p val yklkqrlekqaed P DARG I RP s 

5 LMAKLEKHTGKPKGWLDRKHRERTVPETAAESTGTAETQIAETASAAGCRSVTVNRNTCE 
TQITVSINLDGSGKSRLDTGVPFLEHMIDQIARHGMIDIDISCKGDLHIDDHHTAEDIGI 
TLGQAIRQALGDKKG I RRYGHSY VPLDEALS RVVI DLS GRPGLVYN I E FTRALI GRFDVD 
LFEEFFHGIVNHSMMTLHJDNLSGKNAHHQAETVFKAFGRALRMAVEHDPRMAGQTPSTK 
GTLTA 

10 

Schedule of SEP ID Nos 

SEQ ID No Sequence 



1 


NMB0341 


DNA 


2 


NMB0341 


Protein 


3 


NMB0338 


DNA 


4 


NMB0338 


Protein 


5 


NMB1345 


DNA 


6 


NMB1345 


Protein 


7 


NMB0738 


DNA 


8 


NMB0738 


Protein 


9 


NMB0792 


DNA 


J- vj 


NMB079 7 


Protein 


11 


NMB0279 


DNA 


12 


NMB0279 


Protein 


13 


NMB2050 


DNA 


14 


NMB2050 


Protein 


15 


NMB1335 


DNA 


16 


NMB1335 


Protein 


17 


NMB2035 


DNA 


18 


NMB2035 


Protein 


19 


NMB1351 


DNA 


20 


NMB1351 


Protein 


21 


NMB1574 


DNA 


22 


NMB1574 


Protein 


23 


NMB1298 


DNA 
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24 NMB 1298 Protein 

25 NMB1856 DNA 

26 NMB1856 Protein 

27 NMB0119 DNA 

28 NMB 01 1 9 Protein 

29 NMB 1705 DNA 

30 NMB 1705 Protein 

31 NMB2065 DNA 

32 NMB2065 Protein 

33 NMB0339 DNA 

34 NMB0339 Protein 

35 NMB0401 DNA 

36 NMB0401 Protein 

37 NMB 1467 DNA 

38 NMB 1467 Protein 

39 NMB2056 DNA 

40 NMB2056 Protein 

41 NMB0808 DNA 

42 NMB0808 Protein 

43 NMB0774 DNA 

44 NMB0774 Protein 

45 NMA0078 DNA 

46 NMA0078 Protein 

47 NMB0337 DNA 

48 NMB0337 Protein 

49 NMB0191 DNA 

50 NMB0191 Protein 

51 NMB1710 DNA 

52 NMB1710 Protein 

53 NMB0062 DNA 

54 NMB0062 Protein 

55 NMB 1583 DNA 
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NMB1583 Protein 



